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Nonholonomic systems are control systems which depend linearly on the 
control. Their underlying geometry is the sub-Riemannian geometry, which 
plays for these systems the same role as Euclidean geometry does for linear 
systems. In particular the usual notions of approximations at the first order, 
that are essential for control purposes, have to be defined in terms of this 
geometry. The aim of these notes is to present these notions of approximation 
and their link with the metric tangent structure in sub-Riemannian geometry. 

The notes are organized as follows. In Section [1] we introduce the basic 
definitions on nonholonomic systems and sub-Riemannian geometry. Sec- 
tion [2] is devoted to the study of the controllability of nonholonomic systems, 
and to the topological properties of sub-Riemannian distances. Section [3] 
provides a detailed exposition of the notions of first-order approximation, 
including nonholonomic orders, privileged coordinates, nilpotent approxima- 
tions, and distance estimates such as the Ball-Box Theorem. We then see in 
Section H] how these notions allow us to describe the tangent structure to a 
Carnot-Caratheodory space (the metric space defined by a sub-Riemannian 
distance). Finally, we present in the appendix some results on flows in con- 
nection with the Hausdorff formula (Section IA"]) . and some proofs on privi- 
leged coordinates (Section |B|) . 

1 Geometry of nonholonomic systems 

Throughout these notes we work in a smooth n-dimensional manifold M. 
However most of our considerations are local, so M can also be thought of 
as an open subset of W l . 

1.1 Nonholonomic systems 

A nonholonomic system on M is a control system which is of the form 

m 

q = ^2mXi(q), qeM, u = (u u . . . , u m ) G R m , (£) 

i=l 

where Xi, . . . , X m are C°° vector fields on M. To give a meaning to such a 
control system, we have to define what are its solutions, that is, its trajecto- 
ries. 

Definition 1.1. A trajectory of (JSJ is a path 7 : [0,T] — > M for which 
there exists a function u(-) G L 1 ([0, T], IR m ) such that 7 is a solution of the 
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ordinary differential equation: 

m 

q(t) = Ui(t)Xi(q(t)), for a.e. t G [0, T). 

i=i 

Such a function u(-) is called a control associated with 7. 

Equivalently, a trajectory is an absolutely continuous path 7 on M such 
that 7(t) G A(7(t)) for almost every t G [0,T], where we have set, for every 
qEM, 

A(q) = span {X^q), . . . , X m (q)} C T,M. (1) 

Note that the rank of the vector spaces A(g) is a function of q, which may 
be non constant. If it is constant, A defines a distribution on M, that is, a 
subbundle of TM. 

Example 1.1 (unicycle). The most typical example of nonholonomic system 
is the simplified kinematic model of a unicycle. In this model, a configuration 
q = (x, y, 9) of the unicycle is described by the planar coordinates (x, y) of the 
contact point of the wheel with the ground, and by the angle 9 of orientation 
of the wheel with respect to the x-axis. The space of configurations is then 
the manifold M 2 x S 1 . 

The wheel is subject to the constraint of rolling without slipping, which 
writes as Asm 9 — ycos9 = 0, or, equivalently as q G kercu(g), where u is the 
one-form sin9dx — cos 9 dy. Hence the set A of ([1]) is kero;. 

Choosing as controls the tangential velocity u\ and the angular velocity 
«2j we obtain the nonholonomic system q = UiXi(q) + u 2 X 2 (g) on M 2 x S 1 , 
where X 1 = cos9d x + sxn9d y , and X 2 = dg. 

Let us mention here a few properties of the trajectories of (JSJ) (for more 
details, see |Rif] ). 

• Fix p G M and T > 0. For every control u(-) G L\[0,T],R m ), there 
exists r G (0, T] such that the Cauchy problem 

/ ?(*) = YZMQXMt)) f o r a.e. t G [0,r], 

I 9(0) = p, [Z) 

has a unique solution denoted by 7^ or 7(-;p, u). It is called the trajec- 
tory issued from p associated with u. 

• If the rank of X%, . . . , X m is constant and equal to m on M, every tra- 
jectory is associated with a unique control. Otherwise different controls 
can be associated with the same trajectory. In this case it will some- 
times be useful to consider among these controls only the ones which 
minimize the L l norm J \\u(t)\\dt. By convexity, this defines a unique 
control with which the trajectory is associated. 
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• Any time-reparameterization of a trajectory is still a trajectory: if 
7 : [0,T] — > M is a trajectory associated with a control w, and a : 
[0,5] [0,T] is a C 1 -diffeomorphism, then 70a: [0, S] — > M is a 
trajectory associated with the control a'(s)w (q;(s)). In particular, one 
can reverse time along 7: the resulting path j(T — s), s G [0, T], is a 
trajectory associated with the control —u(T — s). 

In this context, the first question is the one of the controllability: can we 
join any two points by a trajectory? This suggests to introduce the following 
definition. 

Definition 1.2. The attainable set from p G M is defined to be the set A p 
of points attained by a trajectory of (E]) issued from p. 

The question above then becomes: is the attainable set from any point 
equal to the whole manifold Ml We will answer this question in Section [2j 

In the case where the answer is positive, next issues are notably the 
motion planning (i.e. find a trajectory joining two given points) and the 
stabilization (i.e. design the control as a function u(q) of the state in such a 
way that the resulting differential equation is stable). The usual way to deal 
with these problems is to use a first-order approximation of the system. The 
underlying idea is the following. Consider a nonlinear control system in R n , 

x = f(x,u), x G M n , u G M m , 

and a pair (x,u) G M. m+n such that f(x,u) = 0. The linearized system 
around this equilibrium pair is defined to be the linear control system: 

df df 
Sx = -J-(x, u)5x + -±(x, u)5u, 5x e W 1 , 5u e R m . 
ox ou 

If this linearized system is controllable, so is the nonlinear one near x. In this 
case the solutions to the motion planning and stabilization problems for the 
linearized system may be used to construct solutions of the corresponding 
problems for the nonlinear system (see for instance [KhaOl]). Thus, locally, 
the study of the control system amounts to the one of the linearized system. 

Does this strategy apply to nonholonomic systems? Consider a nonholo- 
nomic system (JS} defined on an open subset M of W 1 . For every q G M, the 
pair (q, 0) is an equilibrium pair and the corresponding linearized system is 

m 

Sx = SuiXiiq), 5x G R n , 5u G R m . 
i=i 
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For this linearized system, the attainable set from a point 5q is obviously the 
affine subset 

5q + A(q) = 5g + span{Xi(g),...,X m (g)}. 

Thus, except in the very special case where rankA(g) = n, the linearized 
system is not controllable and the strategy above does not apply, whereas 
nonholonomic systems may be controllable (and generically they are), as we 
will see Section El 

This may be explained as follows. The linearization is a first-order approx- 
imation with respect to a Euclidean (or a Riemannian) distance. However for 
nonholonomic systems the underlying distance is a sub-Riemannian one and 
it behaves very differently from a Euclidean one. Thus, the local behaviour 
should be understood through the study of a first-order approximation with 
respect to this sub-Riemannian distance, not through the linearized system. 

We will introduce now the sub-Riemannian distances. In Section [3] we 
will see how to construct first-order approximations with respect to this kind 
of distances, and how to use them for motion planning for instance. 



1.2 Sub-Riemannian distance 

A nonholonomic system induces a distance on M in the following way. We 
first define the sub-Riemannian metric associated with ([£]) to be the function 
g : TM -> 1 given by 

g{q,v) = inf j^iH + u 2 m : ^ UiXi(q) = v\ , (3) 

for q G M and v G T q M, where we adopt the convention that inf = +oo. 
This function g is smooth and satisfies: 

• g(q,v) = +oo if v & A(q), 

• g restricted to A(q) is a positive definite quadratic form. 

Such a metric allows to define a distance in the same way as in Riemannian 
geometry. 

Definition 1.3. The length of an absolutely continuous path 7(i), t G [0,T], 
is 

length( 7 ) = / y/g (i(t),i(t))dt, 
Jo 
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and the sub-Riemannian distance on M associated with the nonholonomic 
system ([£]) is defined by 



d(p, q) = inf length (7) 



where the infimum is taken over all absolutely continuous paths 7 joining p 
to q. 

Note that only trajectories of (E]) may have a finite length. In particular, 
if no trajectory joins p to q, then d(p, q) = +00. We will see below in 
Corollary 12.41 that, under an extra assumption on the nonholonomic system, 
d is actually a distance function. 

Remark 1.1. When 7 is a trajectory, its length is also equal to 



the minimum being taken over all control u(-) associated with 7. As already 
noticed, this minimum is attained at a unique control which could be defined 
as the control associated with 7. 

An important feature of the length of a path is that it is independent 
of the parametrization of the path. As a consequence, the sub-Riemannian 
distance d(p, q) may also be understood as the minimal time needed for the 
nonholonomic system to go from p to q with bounded controls, that is, 



This formulation justifies the assertion made in Section [LT} for nonholonomic 
systems, first-order approximations with respect to the time should be un- 
derstood as first-order approximations with respect to the sub-Riemannian 
distance. 

Another consequence of (jlj) is that d(p, q) is the solution of a time-optimal 
control problem. It then results from standard existence theorems (see for 
instance |LM67j or |Rifj ) that, when p and q are sufficiently close and d(p, q) < 
00, there exists a trajectory 7 joining p to q such that 





3 a trajectory 7„ : [0, T] — > M s.t. 
7«(0) = P, lu(T) = q, 
and \\u(t)\\ < 1 for a.e. t G [0,T] 




length(7) = d(p,q). 



Such a trajectory is called a minimizing trajectory. 
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Remark 1.2. Any reparameterization of a minimizing trajectory is also mini- 
mizing. Therefore any pair of close enough points can be joined by a minimiz- 
ing trajectory of velocity one, that is, a trajectory 7 such that <7(7(£)j7(0) 
I for a.e. t. As a consequence, there exists a control «(•) associated with 7 
such that || = 1 a.e. Every sub-arc of such a trajectory 7 is also clearly 
minimizing, hence the equality d(p,j(t)) = t holds along 7. 

1.3 Sub-Riemannian manifolds 

The distance d defined in Section 11.21 does not always meet the classical 
notion of sub-Riemannian distance arising from a sub-Riemannian manifold. 
Let us recall the latter definition. 

A sub-Riemannian manifold (M, D, g^) is a smooth manifold M endowed 
with a sub-Riemannian structure (D,gn), where: 

• D is a distribution on M, that is a subbundle of TM; 

• gn is a Riemannian metric on D, that is a smooth function g^ : D — y M 
whose restrictions to -D(g) are positive definite quadratic forms. 

The sub-Riemannian metric associated with (D,gn) is the function gsn '■ 
TM R given by 

„ t„ v \ - ! 9r(q,v) if v e D(q), 

9s R (q,v) - I +Qo otherwise _ (5) 

The sub-Riemannian distance dsn on M is then defined from the metric gsR 
as d is defined from the metric g in Section 11.21 

What is the difference between the two constructions, that is, between 
the definitions ([3]) and (J5J) of a sub-Riemannian metric? 

Consider a sub-Riemannian structure (D,gn). Locally, on some open 
subset U, there exist vector fields Xi, . . . ,X m whose values at each point 
q G U form an orthonormal basis of D(q) for the quadratic form g^; the 
metric gsn associated with (D, g R ) then coincides with the metric g associated 
with Xi, . . . , X m . Thus, locally, there is a one-to-one correspondence between 
sub-Riemannian structures and nonholonomic systems for which the rank of 
A(g) = span {Xi(q), . . . , X m (q)} is constant. 

However this correspondence does not hold globally since, for topological 
reasons, a distribution of rank m may not always be generated by m vector 
fields on the whole M. Conversely, the vector fields X\, . . . , X m of a nonholo- 
nomic system do not always generate a linear space A(q) of constant rank 
equal to m. It may even be impossible, again for topological reasons (for 
instance, on an even dimensional sphere). 
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A way to conciliate both notions is to generalize the definition of sub- 
Riemannian structure. 

Definition 1.4. A generalized sub-Riemannian structure on M is a triple 
(E, a, g R ) where 

• E is a vector bundle over M; 

• a : E — > TM is a morphism of vector bundles; 

• gn is a Riemannian metric on E. 

With a generalized sub-Riemannian structure a metric is associated which 
is defined by 

9sn(q,v) = M{g(q,u) : u G E(q), a{u) = v}, for q G M, v G T g M. 

The generalized sub-Riemannian distance dsn on M is then defined from this 
metric g$R as <i is defined from the metric g. 

This definition of sub-Riemannian distance actually contains the two no- 
tions of distance we have introduced before. 

• Take E = M x M m , a : E -> TM, <x(g, it) = £Xi and the 
Euclidean metric on IR m . The resulting generalized sub-Riemannian 
distance is the distance associated with the nonholonomic system (El) . 

• Take E = D, where D is a distribution on M, a : D TM the 
inclusion, and g# a Riemannian metric on T>. We recover the distance 
associated with the sub-Riemannian structure (D,gj{). 

Locally, a generalized sub-Riemannian structure can always be defined 
by a single finite family Xi, . . . ,X m of vector fields, and so by a nonholo- 
nomic system (without rank condition). It actually appears that this is also 
true globally (see [ABB12J, or |DLPR12] for the fact that a submodule of 
TM is finitely generated): any generalized sub-Riemannian distance may be 
associated with a nonholonomic system. 

In these notes, we will always consider a sub-Riemannian distance d as- 
sociated with a nonholonomic system. However, as we just noticed, all the 
results actually hold for a generalized sub-Riemannian distance. 



9 



2 Controllability 



Consider a nonholonomic system 



m 



q = ^UiXiiq) 



i=i 



on a smooth n-dimensional manifold M. This section is concerned with the 
question of controllability: is the attainable set A p from any point p equal 
to the whole manifold Ml We will see next the implications on the sub- 
Riemannian distance d and on the topology of the metric space (M, d). 

2.1 The Chow-Rashevsky Theorem 

The controllability of (EJ) is mainly characterized by the properties of the Lie 
algebra generated by Xi, . . . , X m . We first introduce notions and definitions 
on this subject. 

Let VF(M) denote the set of smooth vector fields on M. We define A 1 
to be the linear subspace of VF(M) generated by Xi, . . . , X m , 



For s > 1, define A s+1 = A s + [A 1 , A'], where we have set [A\A S ] = 
span{[X, Y] : X G A 1 , Y G A s }. The Lie algebra generated by Xi, . . . , X m 
is defined to be Lie(Xi, . . . , X m ) = |J S>1 A S . Due to the Jacobi identity, 
Lie(X\, . . . ,X m ) is the smallest linear subspace of VF(M) which both con- 
tains X±, . . . , X m and is invariant by Lie brackets. 

Let us denote by / = i\ • • • iy. a multi-index of {1, ... , m}, and by |/| = k 
the length of /. We set 



With these notations, A s = span{X/ : |/| < s}. 

For q G M, we set Lie{X x , . . . ,X m )(q) = {X(q) : X G Lie(X u . . .,X m )}, 
and, for s > 1, A s (g) = {X(q) : X G A s }. By definition these sets are 
linear subspaces of T q M. 

Definition 2.1. We say that ([S]) (or the vector fields Xi, . . . ,X m ) satisfies 
Chow's Condition if 



A 1 



span{Xi, . . . ,X m }. 



[X h , [. . . , [X ik _ 1} X ik ] ...]. 



Lie(Xi, . . . ,X m )(q) = T q M, 



Vq G M. 
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Equivalently, for any q 6 M, there exists an integer r = r(q) such that 
dim A r (q) = n. 

This property is also known as the Lie algebra rank condition (LARC), 
and as the Hormander condition (in the context of PDE). 



Lemma 2.1. If (JSJ) satisfies Chow's Condition, then for every p e M, the 
is a neighbourhood of p. 

Proof. We work in a small neighbourhood U C M of p that we identify with 
a neighbourhood of in M n . 

Let (f)\ = exp(iJQ) be the flow of the vector field Aj, i — 1, . . . , m. Every 
curve 1 1 — y 4>l(q) is a trajectory of and we have 

4>\ = id + tX l + o(t). 

For every multi-index I of {1, . . . , m}, we define the local diffeomorphisms 
<f>\ on [/ by induction on the length |J| of J: if J = iJ, then 

By construction, is the endpoint of a trajectory of (JSJ) issued from g. 

Moreover, on a neighbourhood of p there holds 

= id + t |/| X / + o(t |/| ). (6) 

We postpone the proof of this formula to the Appendix (Proposition IA.4| ). 

To obtain a diffeomorphism whose derivative with respect to the time is 
exactly Xi, we set 

i>li m if * > o, 

ip* = ^ 0i| t |i/|j| if t < and |/| is odd, 

, 0j t |i/|/|] if t < and |/| is even, 



where I = iJ. Thus 



^ = id + tA 7 + o(t), (7) 



and ipj. (q) is the endpoint of a trajectory of ([E]) issued from g. 

Let us choose now commutators Xi x , . . . , X/ n whose values at p span 
T p M. This is possible thanks to Chow's Condition. We introduce the map 
ip defined on a small neighbourhood Q of in M n by 
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We conclude from ([7]) that this map is C l near and has an invertible 
derivative at 0, which implies that it is a local C7 1 -diffeomorphism. Therefore 
(p(£l) contains a neighbourhood of p. 

Now, for every t 6 fi, ip(t) is the endpoint of a concatenation of trajec- 
tories of ([£]), the first one being issued from p. It is then the endpoint of a 
trajectory starting from p. Therefore <p(Q) C A p , which implies that A p is a 
neighbourhood of p. □ 



Theorem 2.2 (Chow-Rashevsky's theorem). If M is connected and if (JS]) 
satisfies Chow's Condition, then any two points of M can be joined by a 
trajectory of . 

Proof. Let p G M. If q G A p , then p G A q . As a consequence, A p = A q 
for any q G M and the lemma above implies that A p is an open set. Hence 
the manifold M is covered by the union of the sets A p that are pairwise 
disjointed. Since M is connected, there is only one such open set. □ 

Remark 2.1. This theorem appears also as a consequence of the Orbit Theo- 
rem (Sussmann, Stefan |Ste74t IS"us73j ): each set A p is a connected immersed 
submanifold of M and, at each point q G A p , Lie(Xi, . . . ,X m )(q) C T q A p . 
Moreover, when the rank of the Lie algebra is constant on M, both spaces 
are equal, i.e. Lie(Xi, . . . , X m )(q) = T q A p . 

Thus, when the Lie algebra generated by X±, . . . , X m has constant rank, 
Chow's Condition is not restrictive: it is indeed satisfied on each A p by the 
restriction of the vector fields X%, . . . , X m to the manifold A p . 

Remark 2.2. The converse of Chow's theorem is false in general. Consider for 
instance the nonholonomic system in IR 3 defined by X\ = d x , X2 = d y +f(x)d z 
where f(x) = e~ x l x for positive x and f(x) = otherwise. The associated 
sub-Riemannian distance is finite whereas X 1; . . . , X m do not satisfy Chow's 
Condition. 

However, for an analytic nonholonomic system (i.e. when M and the vector 
fields Xi, . . . , X m are in the analytic category), Chow's Condition is equiva- 
lent to the controllability of (JS]) (see Nag66. ISus74j ). 



Remark 2.3. Our proof of Theorem 12.21 also shows that, under the assump- 
tions of the theorem, for every point p G M the set 

{exp{t h X h ) o • • ■ o exp(t ik X ik ){p) : k G N, % G R, i j G {1, . . . ,m}} 

is equal to the whole M. This set is often called the orbit at p of the vector 
fields Xi, . . . , X m . 
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2.2 Topological structure of (M,d) 



The proof of Lemma 12.11 gives a little bit more than the openness of A v . 
For e small enough, any 4>\{q), < t < e, is a trajectory of length e. Thus 
(f(ti, . . . ,t n ) is the endpoint of a trajectory of length less than N^ti] 1 ^ 1 ^ + 
■ ' ' + l^nl 1 ^' 1 "')) where N counts the maximal number of concatenations in- 
volved in the ip^s. This gives an upper bound for the distance, 

d(p,<p(t)) <N(\t 1 \ 1 ^ + --- + \t n \ 1 ^). (8) 

This kind of estimates of the distance in terms of local coordinates plays 
an important role in sub-Riemannian geometry, as we will see in Section I3~41 
However here (t\, . . . ,t n ) are not smooth local coordinates, as ip is only a 
C 1 -diffeomorphism, not a smooth diffeomorphism. 

Let us try to replace (ti,...,t n ) by smooth local coordinates. Choose 
local coordinates (yi, . . . ,y n ) centered at p such that -§^\ p = Xj.(p). The 
map (p y = y o p is a C 1 -diffeomorphism between neighbourhoods of in R n , 
and its differential at is dip y = Id®.™- 

Denoting by || • ||rt> the Euclidean norm on IR n , we obtain, for \\t\\^n small 
enough, y^t) = ti + o(\\t\\^n). The inequality jSJ) becomes 

d(p,qy)<N'\\y\\ll:, 

where q y denotes the point of coordinates y, and r = maxj |Jj|. This inequal- 
ity allows to compare d to a Riemannian distance. 

Let gR be a Riemannian metric on M, and da the associated Riemannian 
distance. On a compact neighbourhood of p, there exists a constant c > 
such that g(Xi, Xi)(q) < c _1 , which implies cd R (p, q) < d(p, q). Moreover we 
have dji(p,q y ) > Cst\\y\\ Rn . We have then obtained a first estimate to the 
sub-Riemannian distance. 

Theorem 2.3. Assume flSJ) satisfies Chow's Condition. For any Rieman- 
nian metric g R , we have, for q close enough to p, 

cd R (p,q) < d(p,q) < Cd R (p,q) 1/r , 

where c, C are positive constants and r is an integer such that A r = T V M . 

Remark 2.4. If we choose for g R a Riemannian metric which is compatible 
with g, that is, which satisfies ^_r|a = g, then by construction d R (p,q) < 
d(p,q). 

Corollary 2.4. Under the hypotheses of Theorem \2.2\ d is a distance func- 
tion on M, i.e., 
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(i) d is a function from M x M to [0, oo); 

(ii) d(p,q) = d(q,p) (symmetry); 
(Hi) d(p, q) = if and only if p = q; 

(iv) d(p,q) + d(p,q') < d(p,q') (triangle inequality). 

Proof. By Chow-Rashevsky's theorem (Theorem \2.2\i . the distance between 
any pair of points is finite, which gives (i). The symmetry of the distance 
results from the fact that, if 7(s), s G [0, T], is a trajectory joinings to q, then 
s 1 — y j(T — s) is a trajectory of same length joining q to p. Point (Hi) follows 
directly from Theorem 12.31 Finally, the triangle inequality is a consequence 
of the following remark. If 7(5), s G [0, T], is a trajectory joining p to q and 
Y(s), s G [0, T"], is a trajectory joining q to q', then the concatenation 7 * 7', 
defined by 

7 7lSj \ y(s-T) if se [T, T + T'] , 
is a trajectory joining p to q' whose length satisfies 

length(7 * 7') = length(7) + length^'). 

□ 

A second consequence of Theorem l2.3l is that the sub-Riemannian distance 
d is 1/ r- Holder with respect to any Riemannian distance, and so continuous. 

Corollary 2.5. If (JSj) satisfies Chow's Condition, then the topology of the 
metric space (M, d) coincides with the topology of M as a smooth manifold. 

3 First-order approximations 

Consider a nonholonomic system (}S]) : q = Y^iL\ u iXi{q) on a manifold M 
satisfying Chow's Condition, and denote by d the induced sub-Riemannian 
distance. As we have seen in Section 11.11 the infinitesimal behaviour of 
this system should be captured by an approximation to the first-order with 
respect to d. In this section we will then provide notion of first-order ap- 
proximation and construct the basis of an infinitesimal calculus adapted to 
nonholonomic systems. To this aim, a fundamental role will be played by 
the concept of noholonomic order of a function at a point. We will then see 
that approximations to the first-order appear as nilpotent approximations, 
in the sense that X\ , . . . , X m are approximated by vector fields that generate 
a nilpotent Lie algebra. 
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The whole section is concerned with local objects. Henceforth, through- 
out the section we fix a point p G M and an open neighbourhood U of p that 
we identify with a neighbourhood of in W l through some local coordinates. 

3.1 Nonholonomic orders 

Definition 3.1. Let / : M — > R be a continuous function. The nonholo- 
nomic order of f at p, denoted by ord p (/), is the real number defined by 

ord p (/) = sup{ S eK : f(q) = 0{d(p, q) s ) } . 

This order is always nonnegative. Moreover ord p (/) = if f(p) ^ 0, and 
ord p (/) = +oo if f(p) = 0. 

Example 3.1 (Euclidean case). When M = M. n , m = n, and X« = d Xi , the 
sub-Riemannian distance is simply the Euclidean distance on IR n . In this 
case, nonholonomic orders coincide with the standard ones. Namely, ord (/) 
is the smallest degree of monomials having nonzero coefficient in the Taylor 
series 

of / at 0. We will see below that there exists in general an analogous char- 
acterization of nonholonomic orders. 

Let C°°(p) denote the set of germs of smooth functions at p. For / e 
C°°(p), we call nonholonomic derivatives of order 1 of f the Lie derivatives 
Xif, . . . , X m f. We call further X^Xjf), Xi(Xj(X k f)),. . . the nonholonomic 
derivatives of f of order 2, 3, . . . The nonholonomic derivative of order of 
/ at p is f(p). 

Proposition 3.1. Let f G C°°(p). Then ord p (/) is equal to the biggest 
integer k such that all nonholonomic derivatives of f of order smaller than 
k vanish at p. Moreover, 

f(q)=0{d(p,qr d ^). 
Proof. The proposition results from the following two assertions: 

(i) if £ is an integer such that £ < ord p (/), then all nonholonomic deriva- 
tives of / of order < £ vanish at p; 

(ii) if £ is an integer such that all nonholonomic derivatives of / of order 
< £ vanish at p, then f(q) = 0(d(p,q) e+1 ). 



15 



Let us first prove point (i). Let £ be an integer such that £ < ord p (/). We 
write a nonholonomic derivative of / of order k < £ as 

(X n . . . X lfc /)(p) = ^ — f(eMt k X ik ) o.-.o eMtiXiiM) [ =Q - 

The point g = exp(tkX ik ) o ■ ■ ■ o exp(tiX il )(p) is the endpoint of a trajectory 

of length \ti\ + - — h \t n \. Therefore, d(p, q) < \t\\ + h \t n \. 

Since k < £ < ord p (/), there exists a real number s > such that 
f(q) = 0((|ti| + ■ • • + |t„|) fe+s ). This implies that 



= 0. 

t=o 



Thus point (%) is proved. 

The proof of point (ii) goes by induction on £. For £ = 0, assume that 
all nonholonomic derivatives of / of order < vanish at p, that is f(p) = 0. 
Choose any Riemannian metric on M and denote by da the associated Rie- 
mannian distance on M. Since / is smooth, there holds f(q) < Cst dn{p, q) 
near p. By Theorem 12. 3\ this implies /(g) < Cst d(p, q), and so property (ii) 
for £ = 0. 

Assume that, for a given £ > 0, (ii) holds for any function / (induction 
hypothesis) and take a function / such that all its nonholonomic derivatives 
of order < £ + 1 vanish at p. 

Observe that, for i — 1, . . . , to, all the nonholonomic derivatives of X^f of 
order < £ vanish at p. Indeed, . . . X ik (Xif) = X it . . . X ik X,if . Applying 
the induction hypothesis to Xif leads to Xif(q) = 0(d(p, q) e ) . In other 
words, there exist positive constants Ci, . . . , C m such that, for q close enough 
to p, 

xj( q )<cAp,qY- 



Fix now a point q near p. By Remark 1 1. 2 [ there exists a minimizing curve 
7(-) of velocity one joining p to q. Therefore 7 satisfies 

m 

7(f) = 5>,(t)X t ( 7 (f)) for a.e. t G [0,T], 7 (0) = p, 7 (T) = g, 

8=1 

with ^ji^(t) = 1 a.e. and d(p, 7(f)) = t for any £ e [0,T]. In particular 
d(p, q) = T. 
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To estimate f(q) = f(y(T)j, we compute the derivative of fyy(t)) with 
respect to t, 

m 

i=i 

m 

Y,\<t)\Cid{p,i{t)) 1 <Ct e , 
i=i 

where C = C\ + • — \- C m . Integrating this inequality between and t gives 

\fWt))\<\f(p)\ + £i* +1 - 

Note that f(p) = 0, since the nonholonomic derivative of / of order at p 
vanishes. Finally, at t = T = d(p, q), we obtain 

which concludes the proof of (it). □ 

As a consequence, the nonholonomic order of a smooth (germ of) function 
is given by the formula 

ordp(/) =min{s G N : 3 i u . . . , i s E {1, . . . , m} s.t. (X it . . . X is f) (p) ^ 0}, 

where as usual we adopt the convention that min0 = +oo. 

It is clear now that any function in C°°(p) vanishing at p is of order > 1. 
Moreover, the following basic computation rules are satisfied: for every /, g 
in C°°{p) and every A £ R \ {0}, 

ord p (fg) > oid p (f) + oid p (g), 
ord p (Xf) = ord p (/), 
ord p (/ + g) > min (ordp(/),ord p (5f)). 

Note that the first inequality is actually an equality However the proof of 
this fact requires an additional result (see Proposition 13. 2p . 

The notion of nonholonomic order extends to vector fields. Let VF(p) 
denote the set of germs of smooth vector fields at p. 

Definition 3.2. Let X 6 VF(p). The nonholonomic order of X at p, de- 
noted by ord p (X), is the real number defined by: 

ord p (X) = sup {a G R : ord p (X/) > a + ord p (/), V/ G C°°(p)} . 

The order of a differential operator is defined in the same way. 



dt 



f{l(t)) 



< 
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Note that ord p (X) G Z since the order of a smooth function is an integer. 
Moreover the null vector field X = has infinite order, ord p (0) = +00. 

Since the order of a function coincides with its order as a differential 
operator acting by multiplication, we have the following properties. For 
every X, Y G VF(p) and every / G C°°(p), 

ovd p ([X,Y]) > ord p (X)+ord p (r), 

ord p (/X) > ord p (/)+ord p (X), 

ord p (X) < ord p (X/)-ord p (/), W 

ord p (X + F) > min (ord p (X),ord p (F)). 

As already noticed for functions, the second inequality is in fact an equality. 
This is not the case for the first inequality (take for instance X = Y). 

As a consequence of (JSJ), X±, . . . , X m are of order > — 1, [Xj, Xj] of order 
> — 2, and more generally, every X in the set A k is of order > — k. 

Example 3.2 (Euclidean case). In the Euclidean case (see example 13.11) . the 
nonholonomic order of a constant differential operator is the negative of its 
usual order. For instance d Xi is of nonholonomic order —1. Actually, in this 
case, every vector field that does not vanish at p is of nonholonomic order 
-1. 

Example 3.3 (Heisenberg case). Consider the following vector fields on IR 3 : 

X 1 = d x - -d z and X 2 = d y + ^d z . 
2 2 

The coordinate functions x and y have order 1 at 0, whereas z has order 2 at 
0, since X x x(0) = X 2 y(0) = 1, Xiz(0) = X 2 z(0) = 0, and XiX 2 ;z(0) = 1/2. 
These relations also imply ordc^A^) = ord (A 2 ) = —1. Finally, the Lie 
bracket [X 1 ,X 2 ] = d z is of order —2 at since [Xi, X 2 ]z = 1. 

We are now in a position to give a meaning to first-order approximation. 

Definition 3.3. A family of m vector fields X\, . . . } X m defined near p is 
called a first- order approximation of X\, . . . , X m at p if the vector fields Xj — 
Xj, i — 1, . . . , m, are of order > at p. 

A consequence of this definition is that the order at p defined by the vector 
fields Xi, . . . , X m coincides with the one defined by X±, . . . , X m . Hence for 
any / G C°° (p) of order greater than s — 1 , 

(X n . . . X is f)(q) = (X n . . . XJ){q) + O {d(p, ff )«W/>-+i) . 

To go further in the characterization of orders and approximations, we 
need suitable systems of coordinates. 
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3.2 Privileged coordinates 



We have introduced in Section 12.11 the sets of vector fields A s , defined by 
A s = span{X/ : |/| < s}. Since X±, . . . ,X m satisfy Chow's Condition, the 
values of these sets at p form a flag of subspaces of T P M, that is, 

A\p) c A 2 (p) c • • • C A r -\p) £ A r (p) = T P M, (10) 

where r = r(p) is called the degree of nonholonomy at p. 

Set rii(p) = dim A l (p). The r-tuple of integers (ni(p), . . . ,n r (p)) is called 
the growth vector at p. The first integer n\ (p) < m is the rank of the family 
Xi(p), . . . , X m (p), and the last one n r {p) = n is the dimension of the manifold 
M. 

Let s > 1. By abuse of notations, we continue to write A s for the map 
q I—)- A s (q). This map A s is a distribution if and only if n s (q) is constant on 
M. We then distinguish two kind of points. 

Definition 3.4. The point p is a regular point if the growth vector is constant 
in a neighbourhood of p. Otherwise, p is a singular point. 

Thus, near a regular point, all maps A s are locally distributions. 

The structure of flag (flQj) may also be described by another sequence of 
integers. We define the weights at p, Wi = Wi(p), i = 1, . . . ,n, by setting 
Wj = s if n s _i(p) < j < n s (p), where n = 0. In other words, we have 

Wl = ■ ■ ■ = W ni = 1, W ni+ i = ■ ■ ■ = W n2 = 2, . . . , 

+ 1 = ■ ■ ■ = w nr = r. 

The weights at p form an increasing sequence Wi(p) < ■ ■ ■ < w n (p) which is 
constant near p if and only if p is a regular point. 

Example 3.4 (Heisenberg case). The Heisenberg case in R 3 given in exam- 
ple [3]3] has a growth vector which is equal to (2, 3) at every point. Therefore 
all points of M. 3 are regular. The weights at any point are W\ = W2 = 1, 
w 3 = 2. 

Example 3.5 (Martinet case). Consider the following vector fields on M 3 , 

x 2 

Xi = d x and X 2 = d y + — d z . 

2 

The only nonzero brackets are 

X 12 = [X 1 ,X 2 ] = xd z and X 112 = [X u [X x , X 2 \] = d z . 
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Thus the growth vector is equal to (2, 2, 3) on the plane {x = 0}, and to (2, 3) 
elsewhere. As a consequence, the set of singular points is the plane {x = 0}. 
The weights are w± — W2 = 1, u>3 = 2 at regular points, and Wx = W2 = 1, 
u>3 = 3 at singular ones. 

Example 3.6. Consider the vector fields on R 3 

Xl = d x and X 2 = d y + f(x)d z , 

where / is a smooth function on R which admits every positive integer nGN 
as a zero with multiplicity n (such a function exists and can even be chosen 
in the analytic class thanks to the Weierstrass factorization theorem |Rud70l 
Th. 15.9]). Every point (n,y, z) is singular and the weights at this point are 
Wi — W2 — 1, W3 — n + 1. As a consequence the degree of nonholonomy u> 3 
is unbounded on R 3 . 

Let us give some basic properties of the growth vector and of the weights. 

• At a regular point, the growth vector is a strictly increasing sequence: 
n iip) < ••' < n r (p). Indeed, if n s (q) = n s+ i(q) in a neighbourhood 
of p, then A s is locally an involutive distribution and so s = r. As 
a consequence, at a regular point p, the jump between two successive 
weights is never greater than 1, w i+ \ — Wi < 1, and there holds r(p) < 
n — m+1. 

• For every s, the map q t— > n s (q) is a lower semi-continuous function 
from M to N. Therefore, the set of regular points is open and dense in 
M. 

• For every i = l,...,n, the weight is an upper semi-continuous 
function. In particular, this is the case for the degree of nonholonomy 
r(-) = w n (-), that is, r(q) < r(p) for q near p. As a consequence r(-) is 
bounded on any compact subset of M. 

• The degree of nonholonomy may be unbounded on M (see example 13.61 
above). Thus determining if a nonholonomic system is controllable is 
a non decidable problem: the computation of an infinite number of 
brackets may be needed to decide if Chow's Condition is satisfied. 

However, in the case of polynomial vector fields on R™ (relevant in 
practice), it can be shown that the degree of nonholonomy is bounded 
by a universal function of the degree k of the polynomials (see [Gab95l 
IGJR98 J): 

r(x) < 2 3 " 2 n 2n k 2n . 
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The meaning of the sequence of weights is best understood in terms of 
basis of T p M. Choose first vector fields Yx, . . . , Y ni in A 1 whose values at p 
form a basis of A 1 ^). Choose then vector fields Y ni +i, • • • , Y n2 in A 2 such 
that the values Yi(p), . . . , Y n2 (p) form a basis of A 2 (p). For each s, choose 
Yn s -i+i, ■ ■ ■ 5 ~Ki s m A s such that Yi(jp), . . . , Y ns (p) form a basis of A s (p). We 
obtain in this way a family of vector fields Yx, . . . , Y n such that 

f Yi(p), . . . , y„(p) is a basis of T p M, 

{Ye A w *, i = l,...,n. { } 

A family of n vector fields satisfying (TTTT) is called an adapted frame at p. 
The word "adapted" means here "adapted to the flag fTTUl) ". since the values 
at p of an adapted frame contain a basis Yi(p), . . . , Y ng (p) of each subspace 
A s (p) of the flag. By continuity, at a point q close enough to p, the values of 
Yi, . . . ,Y n still form a basis of T q M. However, if p is singular, this basis may 
not be adapted to the flag fTTUl) at q. 

Let us explain now the relation between weights and orders. We write 
first the tangent space as a direct sum, 

T p M = A 1 (p) © A 2 (p)/A 1 (p) © ■ ■ • © A r (p)/A r - 1 (p), 

where A s (p)/A (p) denotes a supplementary of A s ~ 1 (p) in A s (p), and 
take a local system of coordinates (yi, . . . , y n ). The dimension of each space 
A s (p)/A s_1 (p) is equal to n s — n 5 _i, and we can assume that, up to a re- 
ordering, we have dyj(A s (p)/A s ~ 1 (p)) ^ for n s _i < j < n s . 

Take an integer j such that < j < n\. From the assumption above, 
there holds dyj(A l (p)) ^ 0, and consequently there exists Xj such that 
dyj(Xi(p)) 7^ 0. Since dyj(Xi) = Xiyj is a first-order nonholonomic derivative 
of yj, we have ord p (yj) < 1 = Wj. 

Take now an integer j such that n s _i < j < n s for s > 1, that is, tOj = s. 
Since <i?/ :; (A s (p)/A s ~ 1 (p)) 7^ 0, there exists a vector field Y in A s such that 
dyj{Y{p)) = (Yyj){p) 7^ 0. By definition of A s , the Lie derivative Yyj is 
a linear combination of nonholonomic derivatives of yj of order not greater 
than s. One of them must be nonzero, and so oid p (yj) < s = Wj. 

Finally, any system of local coordinates (yx, . . . , y n ) satisfies ord p (y 3 ) < Wj 
up to a reordering (or Y^=i OT d p (yi) < Yji=i w i without reordering). The 
coordinates with the maximal possible order will play an important role. 

Definition 3.5. A system of privileged coordinates at p is a system of local 
coordinates (zx, ... , z n ) such that ord p (^) = Wj for j — 1, . . . , n. 

Notice that privileged coordinates (zx, ■ ■ ■ , z n ) satisfy 

dzi(A^(p))^0, dz l (A w >- 1 (p)) = 0, z = l,... ,n, (12) 
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or, equivalently, d Zi \ p belongs to A Wi (p) but not to A^ _1 (p). Local co- 
ordinates satisfying (fl2"|) are called linearly adapted coordinates ("adapted" 
because the differentials at p of the coordinates form a basis of T*M dual 
to the values of an adapted frame). Thus privileged coordinates are always 
linearly adapted coordinates. The converse is false, as shown in the example 
below. 

Example 3.7. Take X± = d x , X 2 — d y + (x 2 + y)d z in R 3 . The weights at are 
(1,1,3) and (x,y,z) are adapted at 0. But they are not privileged: indeed, 
the coordinate z is of order 2 at since (X 2 X 2 z)(0) = 1. 

Remark 3.1. As it is suggested by Kupka |Kup96| , one can define privileged 
functions at p to be the smooth functions / on U such that 

ordp(/) = min{s e N : df(A s (p)) ^ 0}. 

It results from the discussion above that some local coordinates (z\, . . . , z n ) 
are privileged at p if and only if each Zi is a privileged function at p. 

Let us now show how to compute orders using privileged coordinates. We 
fix a system of privileged coordinates (zi, . . . , z n ) at p. Given a sequence of 
integers a = («i, . . . ,a n ), we define the weighted degree of the monomial 
z a = z" 1 ■ ■ ■ z% n to be w(a) = w^ai + ■ — h w n a n and the weighted degree of 
the monomial vector field z a d Zj as w(a) — Wj. The weighted degrees allow to 
compute the orders of functions and vector fields in a purely algebraic way. 

Proposition 3.2. For a smooth function f with a Taylor expansion 

f(z)~Y,c a z a , 

a 

the order of f is the least weighted degree of monomials having a nonzero 
coefficient in the Taylor series. 

For a vector field X with a Taylor expansion 

X(z) ~ ^a a jZ a d Zj} 

<*,j 

the order of X is the least weighted degree of a monomial vector fields having 
a nonzero coefficient in the Taylor series. 

In other words, when using privileged coordinates, the notion of nonholo- 
nomic order amounts to the usual notion of vanishing order at some point, 
only assigning weights to the variables. 
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Proof. For i = 1, . . .,n, we have d Zi \ p £ A Wi (p). Then there exist n vector 
fields Y\ , . . . , Y n which form an adapted frame at p and such that Y\ (p) = 
d Zl \ p , . . . , Y n (p) = d Zn \ p . For every i, the vector field Yi is of order > — Wi at 
p since it belongs to A Wi . Moreover we have (Yj,Zi)(p) = 1 and ord p (zi) = Wi. 
Thus ordp(Fj) = — Wi. 

Take a sequence of integers a = («i, . . . ,ac n ). The monomial z a is of 
order > w(a) at p and the differential operator Y a = Y"" 1 • • -Y® n is of 
order > —w(a). Observing that (YiZj)(p) — if j ^ i, we easily see that 
(Y a z a )(p) = Qi / qJ ^ 0, whence ord p (z a ) = w(a). 

In the same way, we obtain that, if z a , z 13 are two different monomials and 
A, fi two nonzero real numbers, then ord p (\z a + ^z^) = min (u>(a), 
Thus the order of a series is the least weighted degree of monomials actually 
appearing in the series itself. This shows the statement on order of functions. 

As a consequence, for any smooth function /, the order at p of d Zi f is 
> ordp(/) — Wi. Since moreover d Zi Zi = 1, we obtain that ord p (d Zi ) is equal 
to —W{. The second part of the statement follows. □ 

A notion of homogeneity is also naturally associated with a system of 
privileged coordinates (z\, . . . , z n ) defined on U. We define first the one- 
parameter family of dilations 

St : (z 1 ,...,z n ) ^ (t^ Zl ,...,t w "z n ), t>0. 

Each dilation 5 t is a map from M n to M n . By abuse of notations, for q £ U 
and t small enough we write S t (q) instead of S t (z(q)), where z(q) are the 
coordinates of q. A dilation 5t acts also on functions and vector fields by pull- 
back: 5*f = foS t and 8*X is the vector field such that (5*X)(5* t f) = 5*(Xf). 

Definition 3.6. A function / is homogeneous of degree s if 5*f = t s f. A 
vector field X is homogeneous of degree a if 5^X = t°X. 

For a smooth function (resp. a smooth vector field), this is the same as 
being a finite sum of monomials (resp. monomial vector fields) of weighted 
degree s. As a consequence, if a function / is homogeneous of degree s, then 
it is of order s at p. 

A typical degree 1 homogeneous function is the so-called pseudo-norm at 
p, defined by: 

z^\\z\\ p = \z 1 \ 1/wi + --- + \z n \ 1 / w ". (13) 

When composed with the coordinates function, the pseudo-norm at p is a 
(non smooth) function of order 1, that is, 

\\z(q)\\ p = 0(d(p,q)). 

Actually, it results from Proposition 13.21 that the order of a function / £ 
C°°(p) is the least integer s such that f(q) = 0(||^(g)||*). 
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Examples of privileged coordinates. Of course all the results above on 
algebraic computation of orders hold only if privileged coordinates do exist. 
Two types of privileged coordinates are commonly used in the literature. 

a. Exponential coordinates. Choose an adapted frame Yi, . . . , Y n at 

p. The inverse of the local diffeomorphism 



defines a system of local privileged coordinates at p, called canonical coordi- 
nates of the first kind. These coordinates are mainly used in the context of 
hypoelliptic operator and for nilpotent Lie groups with right (or left) invari- 
ant sub-Riemannian structure. 

The inverse of the local diffeomorphism 



also defines privileged coordinates at p, called canonical coordinates of the 
second kind. They are easier to work with than the one of the first kind. For 
instance, in these coordinates, the vector field Y n read as d Zn . One can also 
exchange the order of the flows in the definition to obtain any of the Y^ as 
d z .. The fact that canonical coordinates of both first and second kind are 
privileged is proved in Section IB. 1L 

We leave it to the reader to verify that the diffeomorphism 

(zi, ...,z n )^ exp(z n Y n H h z s+1 Y s+1 ) o exp(z s Y s ) • • • o exp(z 1 Y 1 )(p) 

also induces privileged coordinates. As a matter of fact, any "mix" between 
first and second kind canonical coordinates defines privileged coordinates. 

b. Algebraic coordinates. There exist also effective constructions 
of privileged coordinates (the construction of exponential coordinates is not 
effective in general since it requires to integrate flows of vector fields). We 
present here Bellaiche's algorithm, but other constructions exist (see [Ste86l 



1. Choose an adapted frame Y±, . . . , Y n at p. 

2. Choose coordinates (yi, . . . ,y n ) centered at p such that d Vi \ p = Y^p). 

3. For j = 1, . . . ,n, set 



(zx, . . . , z n ) i-> exp(ziYi H h z n Y n )(p) 



(zi, ...,z n )^ exp(^ n F n ) o • • • o exp(ziYi)(p) 



Wj-l 




k=2 



24 



where, for k — 2, . . . , Wj — 1, 

h k (vi, vj-i) = £ Y " ■ ■ ■ Y ^ {vj - £ Mi/)) W ?T ' ' ' f^T' 

tu(a)<tt)j 

with |a| = «i + • • • + a n . 

The fact that coordinates (z%, . . . ,z n ) are privileged at p will be proved in 
Section MM 

Coordinates (yi, . . . ,y n ) are linearly adapted coordinates. They can be 
obtained from any original system of coordinates by an affine change. The 
privileged coordinates (zi, . . . , z n ) are then obtained from (yi, . . . , y n ) by an 
expression of the form 

zi = yi, 

Z2 = J/2+POl(j/l), 

z n = y n + V>ol(y 1 ,...,y n -i), 

where each pol is a polynomial function without constant nor linear terms. 
The inverse change of coordinates takes the same triangular form, which 
makes the use of these coordinates easy for computations. 

3.3 Nilpotent approximation 

Fix a system of privileged coordinates (z\, . . . , z n ) at p. Every vector field 
Xi is of order > — 1, hence it has, in z coordinates, a Taylor expansion 

x i{z) ~ y^ j a a ,jZ a d z ., 

a.,3 

where w(a) > Wj — 1 if a a j ^ 0. Grouping together the monomial vector 
fields of same weighted degree, we express Xi as a series 

x t = xt l) +xf )) + x^ + ... 

where X^ is a homogeneous vector field of degree s. 

Proposition 3.3. Set Xi = X^\ i — 1, . . . , m. The family of vector fields 
Xi, . . . , X m is a first-order approximation of X%, . . . , X m at p and generate 
a nilpotent Lie algebra of step r = w n . 
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Proof. The fact that the vector fields X\ , . . . , X m form a first-order approx- 
imation of Xi, . . . , X m results from their construction. 

Note further that any homogeneous vector field of degree smaller than 
— w n is zero, as it is easy to check in privileged coordinates. Moreover, if X 
and Y are homogeneous of degree respectively k and /, then the bracket [X, Y] 
is homogeneous of degree k + / because <5 t *[X, Y] = [S*X, S*Y] = t k+l [X, Y]. 

It follows that every iterated bracket of the vector fields Xi, . . . , X m of 
length k (i.e. containing k of these vector field) is homogeneous of degree —k 
and is zero if k > w n . □ 

Definition 3.7. The family (X 1; . . . ,X m ) is called the (homogeneous) nilpo- 
tent approximation of (X±, . . . , X m ) at p associated with the coordinates z. 

Example 3.8 (unicycle). Consider the vector fields on M 2 x S 1 defining the 
kinematic model of a unicycle (see example 11.11) . that is, X 1 = cos9d x + 
sm9d y , X 2 = do. We have [Xi,X 2 ] = sin^c^ — cos9d y , so the weights are 
(1,1,2) at every point. At p = 0, the coordinates (x, 9) have order 1 and 
y has order 2, consequently (x, 9, y) is a system of privileged coordinates at 
0. Taking the Taylor expansion of X x and X 2 in the latter coordinates, we 
obtain the homogeneous components: 

X{- 1} =d x + 9d y , X( 0) =0, X [ 1] = -^d x -^d yj ... 

and Xg ^ = X2 = do. Therefore the homogeneous nilpotent approximation 
of (Xi,X 2 ) at in coordinates (x,9,y) is 

Xi = d x + 9d y , X 2 = d e . 

We easily check that the Lie brackets of length 3 of these vectors are zero, 
that is, [X 1; [X 1; X 2 ]] = [X 2 , [X 1; X 2 ]] = 0, and so the Lie algebra Lie(X 1; X 2 ) 
is nilpotent of step 2. 

The homogeneous nilpotent approximation is not intrinsic to the frame 
(Xi, . . . ,X m ), since it depends on the chosen system of privileged coordi- 
nates. However, if Xi, . . . , X m and X[, . . . , X' m are the nilpotent approxi- 
mations associated with two different systems of coordinates, then their Lie 
algebras Lie(Xi, . . . , X m ) and Lie(X[, . . . , X^) are isomorphic. If moreover 
p is a regular point, then Lze(X 1; . . . , X m ) is isomorphic to the graded nilpo- 
tent Lie algebra 

Gr(A) p = A(p) © (AVA 1 )^) © • • • © (A r -7A r )(p). 
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Remark 3.2. The nilpotent approximation denotes in fact two different ob- 
jects. Each X-i can be seen as a vector field on M n or as the representation in 
z coordinates of the vector field z*Xi defined on a neighbourhood of p in M. 
This will cause no confusion since the nilpotent approximation is associated 
with a given system of privileged coordinates. 

It is worth to notice the particular form of the nilpotent approximation in 
privileged coordinates. Write X { = YTj=i fij( z )9 Zj - Since X { is homogeneous 
of degree —1 and d Zj of degree —Wj, the function is a homogeneous poly- 
nomial of weighted degree Wj — 1. In particular it can not involve variables 
of weight greater than Wj — 1, that is, 



The nonholonomic control system z = 'Y^ = iUiX i {z) associated with the 
nilpotent approximation is then polynomial and in a triangular form, 



Computing the trajectories of a system in such a form is rather easy: given 
the input function (ui(t), . . . , u m (t)), it is possible to compute the coordinates 
zj one after the other, only by integration. 

As vector fields on R n , X±, . . . ,X m generate a sub-Riemannian distance 
on R™ which is homogeneous with respect to the dilation 5 t . 

Lemma 3.4. 

(i) The family (X 1: . . . ,X m ) satisfies Chow's Condition on R n . 

(ii) The growth vector at of (X 1: ... : X m ) is equal to the one at p of 
(Xi, . . . , X rn ). 

Let d be the sub-Riemannian distance on M. n associated with (Xi, . . . ,X m ). 
(Hi) The distance d is homogeneous of degree 1, 



n 




rn 




i=l 



d(5 t x,5 t y) = td(x,y). 



(iv) There exists a constant C > such that, for all z G W 1 , 



1 



z\\ p < d(0,z) < C\\z 



C 
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where \\ ■ \\ p denotes the pseudo-norm at p (see (|13[) ). 

Proof. Through the coordinates z we identify the neighbourhood U of p in 
M with a neighbourhood of in R". 

For every iterated bracket Xj = [X ik , . . . , [Jfj 2 , JQJ] of the vector fields 
Xi, . . . ,X m , we setXj = [X ik , . . . ,[X i2 , XiJI, and for k > 1 we set A k = 
span{X/ : |/| < k}. As noticed in the proof of Proposition ^. 3[ a bracket Xj 
of length |/| = k is homogeneous of weighted degree —k, and by construction 
of the nilpotent approximation, there holds Xj = Xj+ terms of order > —k. 
Therefore, 

Xj(0) = X/(p) mod span{9 Zj | : Wj < k} — X^p) mod A k ~ 1 (p). 

As a consequence, for any integer k > 1, we have 

dimA fc (0) = dimA fe (p), (14) 

and property (m) follows. Moreover, if Xj x , . . . , Xj n form an adapted frame 
at p, then the family (X/^O), . . . , Xj n (0)) is of rank n, which implies that 
its determinant is nonzero. Since the determinant of Xj 1 , . . . , Xj n is an ho- 
mogeneous polynomial of weighted degree 0, it is nonzero everywhere, which 
implies (i). 

As for the property (in), consider the nonholonomic system defined by 
the nilpotent approximation, that is, z = YlT=i u iXi(z) . Observe that, if 7 
is a trajectory of this system, that is, if 

m 

%t) = Y,^x l {%t)), te[o,T], 

i=l 

then the dilated curve 5xj satisfies 

, m 

^A7W = ^A Mi X,(5 A 7(t)), te[0,T). 

i=l 

Thus <5a7 is a trajectory of the same system, with extremities (Sxj)(0) = 
^a(t(0)) an d (3\l)(T) = 5\(j(T)), and its length equals Alength(7). This 
proves the homogeneity of d. 

Finally, since (Xi, . . . , X m ) satisfies Chow's Condition, the distance d(0, •) 
is continuous on M™ (see Corollary 12. 5p . We can then choose a real number 
C > such that, on the compact set {||z|| = 1}, we have 1/(7 < d(0, z) < C. 
Both functions d(0, z) and ||z|| being homogeneous of degree 1, the inequality 
of Property (iv) follows. □ 
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3.4 Distance estimates 



As it is the case for Riemannian distances, in general it is impossible to com- 
pute analytically a sub-Riemannian distance (it would require to determine 
all minimizing curves). This is very important to obtain estimates of the 
distance, at least locally. In a Riemannian manifold (M,g), the situation is 
rather simple: in local coordinates x centered at a point p, the Riemannian 
distance cLr satisfies: 

d R (q,q') = \\x(q)-x(q')\\ gp + o(\\x(q)\\ gp + \\x(q')\\ gp ), 

where || • \\ g is the Euclidean norm induced by the value g p of the metric g at 
p. This formula has two consequences: first, it shows that the Riemannian 
distance behaves at the first-order as the Euclidean distance associated with 
|| • \\ g ; secondly, the norm || ■ || g gives explicit estimates of cIr near p, such as 

^\\x{q)\\ 9p < d R (p,q) < C\\x(q)\\ gp . 

In sub-Riemannian geometry, the two properties above hold, but do not 
depend on the same function: the first-order behaviour near p is characterized 
by the distance d p defined by a nilpotent approximation at p, whereas explicit 
local estimates of d(p, ■) are given by the pseudo-norm at p \\ ■ \\ p defined 
in ( JIB"]) . We first present the latter estimates, often referred to as the "Ball- 
Box Theorem" , and then the first-order expansion of d in Theorem 13.81 

Theorem 3.5. The following statement holds if and only if Z\, . . . , z n are 

privileged coordinates at p: 

there exist constants C p and e p > such that, if d(p, q z ) < e p , then 

^r\\z\\ p <d(p,q z )<C p \\z\\ p (15) 

(as previously, q z denotes the point near p with coordinates z and \\ ■ \\ p the 
pseudo-norm atp). 

Corollary 3.6 (Ball-Box Theorem). Expressed in a given system of privi- 
leged coordinates, the sub-Riemannian balls B(p,e) satisfy, fore < e p , 

Box(-^-£:) C B(p,s) C Box(G>:), 

where Box(e) = [s w \e Wl ] x ■ ■ • x [-e Wn ,e Wn ]. 



29 



Remark 3.3. The constants C p and e p depend on the base point p. Around a 
regular point p , it is possible to construct systems of privileged coordinates 
depending continuously on the base point p. In this case, the corresponding 
constants C p and e p depend continuously on p. This is no longer true at a 
singular point. In particular, if p is singular, the estimate (fl5|) does not hold 
uniformly near p : we can not choose the constants C p and e p independently 
on p near p . We will see in section 14.21 uniform versions of the Ball-Box 
Theorem. 

The Ball-Box Theorem is stated in different papers, often under the hy- 
pothesis that the point p is regular. To our knowledge, two valid proofs 
exist, the ones in [NSW85J and in [Bel96j. The result also appears without 
proof in |Gro96] and in [Ger84j . and with erroneous proofs in |Mit85] and 
in |Mon02| . 

We present here a proof adapted from the one of Bellai'che (our is much 
simpler because Bellai'che actually proves a more general result, namely (1201) ). 
Basically, the idea is to compare the distances d and d. The main step is 
Lemma 13.71 below, which is essential in other respects to explain the role of 
nilpotent approximations in control theory. 

Fix a point p G M, and a system of privileged coordinates at p. Through 
these coordinates we identify a neighbourhood of p in M with a neighbour- 
hood of in W 1 . As in the preceding subsection, we denote by X\, . . . , X m 
the homogeneous nilpotent approximation of Xi, . . . ,X m at p (associated 
with the given system of privileged coordinates) and by d the induced sub- 
Riemannian distance on M n . Recall also that r = w n denotes the degree of 
nonholonomy at p. 

Lemma 3.7. There exist constants C and e > such that, for any xq G M n 
and any t e IR + with r = max(\\xo\\ p ,t) < e, we have 

\\x(t)-x{t)\\ p < Crt 1/r , 

where x(-) and x(-) are trajectories of the nonholonomic systems defined re- 
spectively by X-l, . . . , X m and X\, . . . , X m , starting at the same point x , as- 
sociated with the same control function u(-), and satisfying \\u(t)\\ = 1 a.e. 

Proof. The first step is to prove that ||^(t)|| p and ||x(t)|| p < Cstr for small 
enough r, where Cst is a constant. Let us do it for x(t), the proof being 
exactly the same for x(t). 

The equation of the control system associated with X±, . . . , X m is 

m 

^3 = ^2 U i{fij( X ) + r ij( X ))' j = 1, ■ ■ ■ , ft, 
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where fij(x) + ry{x) is of order < Wj — 1 at 0. Thus, for j = 1, . . . , n and 
i = 1, . . . ,m, \fij(x)+Vij (x)\ < Cst \\xWp 3 1 when \\x\\ p is small enough. Note 
that, along the trajectory x(t), \\x\\ p is small when r is. Since = 1 

a.e., we get: 

\±j\< Cst\\x\\^-\ (16) 

To integrate this inequality, choose an integer iV such that all N/wj are 

even integers and set \\x\\n = ( Yl7=i \xi\ N ^ Wi Y^ N . The function ||x||jv is 
equivalent to in the norm sense, and it is differentiable except at the 
origin. Inequality (FIBl) implies 4||x||jv < Cst , and then, by integration, 

\\x{t)\\ N < Cstt+ \\x(0)\\ N < Cstr. 

The functions \\x\\n and being equivalent, we obtain, for a trajectory 
starting at Xq, \\x(t)\\ p < Cstr when r is small enough. 

The second step is to prove \xj(t)—Xj(t)\ < Cstr w H. The function Xj—Xj 
satisfies the differential equation 

m 

X 3 ~ X j = ^ ] Ui jfij( X ) ~~ fij( X ) + r ij( X ))i 
i=l 
rn 

= ^2 Ui ( ^2 ( x k ~ x k)Qijk(x,x) + r i:j (x)), 

i=l {k:w k <Wj} 

where Qijk(x, x) is a homogeneous polynomial of weighted degree Wj— — 1. 
For || x ||p and \(x\\ p small enough, we have 

Mx)| < Cst \\x\\^ and \Q ijk (x,x)\ < Cst (\\x\\ p + \\x\\ p ) w *- w >°- 1 . 

Using the inequalities of the first step, we obtain finally, for r small enough, 

\xj{t) - ij{t)\ < Cst Mt) ~ Xk(t)\T w i~ w *- 1 + Cstr w >. (17) 

{k:w k <Wj} 

This system of inequalities has a triangular form, hence it can be in- 
tegrated iteratively. For Wj = 1, the inequality is \xj(t) — Xj(t)\ < Cstr, 
and so \xj(t) — Xj(t)\ < Cstrt. By induction, let j > rii and assume 
\xk{t) — Xk(t)\ < Cstr Wk t for k < j. Inequality f[T7|) implies 

\ Xj (t) -Xj(t)\ < CstT w ^H+ Cstr w i < Cstr w ^ 

and so \xj(t) — Xj(t)\ < Cstr w H. 
Finally, 

\\x{t) - x{t) \\ p < Cst r(t 1/wi + ■■■ + t 1/Wn ) < Cst rt 1/r , 
which completes the proof of the lemma. □ 
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Proof of Theorem \3.5[ Observe first that, by definition of order, a system of 
coordinates z is privileged if and only if d(p,q z ) > Cst \\z\\ p . What remains 
to prove is that, if z are privileged coordinates, then d(p,q z ) < Cst \\z\\ p . 
We will show that, for \\x°\\ p small enough, 

d(0,x°) < 2d(0,x°), 

and so d(0,x°) < Cst \\x°\\ p by Lemma [3.41 This will prove Theorem 13.51 

Fix x° G lR n , ||a; ||p < e. Let xo(t), t G [0,To], be a minimizing curve for 
d, having velocity one, and joining x° to 0. According to Remark 11.21 such 
a curve exists, and there exists a control Uq(-) associated with x such that 
||iio(i)|| = 1 a.e. Moreover, T = d(0,x°). 

Let Xo(t), t G [0, T ], be the trajectory of the control system associated 
with Xi, . . . , X m starting at x° and defined by u (-). We have length (x (-)) < 
To. Set x 1 = Xq(T q ). By Lemma 13771 

II^IIp = lko(T Q ) -x (T )\\ p < CtT^ 1 ', 

where r = max(||a; ||p, T ). By Lemma I3"T4"1 T = d(0,x°) satisfies T > 
||a; || p /C", so t < C'T , and 

d^x 1 ) < C'\\x% < C"d(0,x°) 1+1/r \ 

with C" = C' 2 C. 

Choose now Xi(t), t G [0,Ti], a minimizing curve for d of velocity one 
joining x 1 to 0. There exists a control associated with x\ such that 

it) || = 1 a.e. Let Xi(t), t G [0, TJ, be the trajectory of the control 
system associated with X\, . . . ,X m starting at x 1 and defined by Set 
x 2 = Xi(Tx). As previously, we have length(xi(-)) = ^(O,^ 1 ) and d(0,x 2 ) < 
C"d{0,x l ) l+l ' r . 

Repeating this construction, we obtain a sequence of points x°, x 1 , x 2 , . . . 
such that d(0,x k+1 ) < C' d(0, x k ) 1+1 ^ r , and a sequence of trajectories 
joining x k to x k+l of length equal to d(0,x k ). 

Taking ||a; ||p small enough, we can assume C d(0, x ) 1 ^ < 1/2. We have 
then ^(O,^ 1 ) < d(0,x°)/2, d(0, x k ) < d(0, x°)/2 fc ,. . . Consequently, x k 
tends to as k — > +oo. Putting end to end the curves gives a trajectory 
joining x° to of length d(0,x°) + d(0, x 1 ) + • • • < 2d(0,x°). This implies 
d(Q,x°) < 2d(0,x°), and the proof is complete. □ 

Now, the distance d on M. n induces a distance d p on a neighbourhood of p 
in M by setting d p (q,q') = d(z(q), z(q J )). This distance gives the first-order 
term in the expansion of d(p, •). 
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Theorem 3.8. On a neighbourhood of p in M there holds 



dip, q) = dpip, q)(l + (dpip, q) 

Remark 3.4. By Theorem 13.51 and Lemma [3.4} when d(p,q) is small enough 
we get the estimate 

gdp(p, q) < d(p, q) < Cd p (p, q), (18) 

where C is some positive constant. Theorem 13.81 essentially states that this 
constant can be chosen arbitrarily close to 1. 

Proof. Fix 5 > 0. We have to prove that there exists e > such that, if 
dip, q) < e, then 

(1 - 8)d p (p, q) < dip, q)<(l + 8)d p ip, q). (19) 

Let q be a point in M. Setting x° = q in the proof of Theorem 13.51 
furnishes a trajectory joining q to which length is equal to J2T=o di®, % k ), 
the points x k being such that d(0,x k+1 ) < C"d(0, x k ) 1+1/r . 

From (TTBl) . there exists e > such that d(p, q) < e implies C d(0, x°) 1//r < 
6/(1 + 5). In this case the trajectory from q to is of length not greater than 
(1 + 8)d(0, x°) and we have 

d(p,q) < (1 + 5)d p (p,q). 

To prove the other inequality in fjTTJ]) . we use the same argument but 
reverse the role of d and d. Let Xo{t), t e [0,T ], be a minimizing curve for 
d of velocity one joining x° to 0, and let %(•) be a control associated with 
xq(-) such that ||w (t)|| = 1 a.e. We have T = d(0,x°). Let x (t), t G [0, T ], 
be the trajectory of the control system associated with X\, . . . ,X m starting 
at x° and defined by the control uq(-). In particular, length (xo(-)) < ^o- 

Set x 1 = x (T ). By Lemma [3771 

Ha; 1 ||p = ||s (To) - x {T )\\ p < CrT 1/r , 



where r = max(||x°|| p , T ). Theorem 13.51 implies r < C p T , and 



^(O,^ 1 ) < CpWx 1 ^ < C"d{0,x°) 1+1/r , 



with C" = ClC 



V 



Repeating this construction gives a trajectory of X\, . . . , X m joining q to 
p whose length is equal to d(0, x k ), where d(0, x k+1 ) < C"d(0, x k ) 1+1 ^ r . 
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For d(p,q) small enough, we have C"d(0, x ) 1 ^ < 5/(1 — 5) and the 
trajectory from q to is of length < 1/(1 — 5)d(0,x°), which leads to 



d P {p, q) < 



1 



d(p,q). 



(1-5) 



This completes the proof. 



□ 



3.5 Approximate motion planning 

Given a control system (£), the motion planning problem is to steer (E) 
from an initial point to a final point. For nonholonomic systems, the exact 
problem is in general unsolvable. However methods exist for a particular 
class of systems, namely for nilpotent (or nilpotentizable) systems. It is 
then of interest to devise approximate motion planning techniques based on 
nilpotent approximations. These techniques are Newton type methods, the 
nilpotent approximation playing the role of the usual linearization. 
Precisely, consider a nonholonomic control system 



and initial and final points a and b in M. n . Denote by Xi, . . . , X m a nilpotent 
approximation of X ± , . . . , X m at b. The /c-step of an approximate motion 
planning algorithm take the following form (x k denotes the state of the sys- 
tem, x° being the initial point a): 

1. compute a control u(t), t G [0, T], steering the control system associated 
with Xi, . . . , X m from x k to b; 

2. compute the trajectory x(-) of (£) with control u(-) starting from x k ; 

3. S et x k+l = x(T). 

Is this algorithm convergent or, at least, locally convergent? The answer 
to the latter question is positive under an extra hypothesis on the control 
given in point 2 of the algorithm, namely, 

(H) there exists a constant K such that, if x k and b are close enough, then 



rn 



x = 



^2uiXi(x), xeR n , 



i=i 
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Note that a control corresponding to a minimizing curve for d satisfies this 
condition. Other standards methods using Lie groups (like the one in [LS91j ) 
or based on the triangular form of the homogeneous nilpotent approximation 
also satisfy this hypothesis. 

The local convergence is then proved exactly in the same way than The- 
orem [X3 We normalize first the control, so that ||u(i)|| = 1 a.e. Then from 
Lemmas O and ES we have d(b,x k+1 ) < C"T 1+1 '\ and using hypothesis 
(H), we obtain 

d(b,x k+l ) < C"K 1+1/r d(b,x k ) l+l/r . 

If a is close enough to b, we have, at each step of the algorithm, d(b, x k+1 ) < 
d(b,Xk)/2, which proves the local convergence of the algorithm. In other 
words, for each point b e M, there exists a constant > such that, if 
d(a, b) < Sb, then the approximate motion planning algorithm steering the 
system from a to b converges. 

To obtain a globally convergent algorithm, a natural idea is to iterate the 
locally convergent one. This requires the construction of a finite sequence of 
intermediate goals b = a, 6 1( . . . , b]y = b such that d(b^i, &j) < e^. However 
the constant e& depends on b and, as already noticed for Theorem I3.5[ it is 
not possible to have a uniform nonzero constant near singular points. Thus 
this method may provide a globally convergent algorithm only when every 
point is regular. 

4 Tangent structure to Carnot-Caratheodory 
spaces 

Consider a manifold M endowed with a sub-Riemannian distance d on M. 
The so-defined metric space (M, d) is called a Carnot-Caratheodory space. 
The notion of first-order approximation introduced in the previous section 
has a metric interpretation and will allow us to describe the local structure 
of a Carnot-Caratheodory space. 

4.1 Metric tangent space 

In describing the tangent space to a manifold, we essentially look at smaller 
and smaller neighbourhoods of a given point, the manifold being fixed. Equiv- 
alently, we can look at a fixed neighbourhood, but expanding the manifold. 
As noticed by Gromov, this idea can be used to define a notion of tangent 
space for a general metric space. 
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If X is a metric space with distance d, we define AX, for A > 0, to be 
the metric space with same underlying set as X and distance Xd. A pointed 
metric space (X, x) is a metric space with a distinguished point x. 

Loosely speaking, a metric tangent space to the metric space X at x is a 
pointed metric space (C X X, y) such that 

(C x X,y)= lim (XX, x). 

A— s-+oo 

Of course, for this definition to make sense, we have to define the limit of 
pointed metric spaces. 

Let us first define the Gromov-Hausdorff distance between metric spaces. 
Recall that, in a metric space X, the Hausdorff distance H-dist(A, B) between 
two subsets A and B of X is the infimum of p > such that any point of A 
is within a distance p of B and any point of B is within a distance p of A. 
The Gromov-Hausdorff distance GH-dist(X, Y) between two metric spaces X 
and Y is the infimum of Hausdorff distances H-dist(i(X), j(Y)) over all metric 
spaces Z and all isometric embeddings i :X — > Z, j :Y — > Z. 

Thanks to Gromov-Hausdorff distance, one can define the notion of limit 
of a sequence of pointed metric spaces: (X n , x n ) converge to (X, x) if, for any 
positive r, 

GR-dist(B Xn (x n ,r),B x (x,r)) as n ->• +oo 

where B y (y, r) is considered as a metric space, endowed with the distance of 
Y . Note that all pointed metric spaces isometric to (X, x) are also limit of 
(X n ,x n ). However the limit is unique up to an isometry provided the closed 
balls around the distinguished point are compact |BBI01| Sect. 7.4]. 

Finally, one says that (X\,x\) converge to (X, x) when A — )■ oo if, for 
every sequence A n , (X\ n ,x\ n ) converge to (X, x). 

Definition 4.1. A pointed metric space (C x X,y) is a metric tangent space 
to the metric space X at a; if (AX, x) converge to (C x X,y) as A — > +oo. If it 
exists, it is unique up to an isometry provided the closed balls around x in 
(AX, x) are compact. 

For a Riemannian metric space (M, d^) induced by a Riemannian metric g 
on a manifold M, metric tangent spaces at a point p exist and are isometric to 
the Euclidean space (TpM,g p ), that is, the standard tangent space endowed 
with the scalar product defined by the quadratic form g p . 

For a Carnot-Caratheodory space (M, d), the metric tangent space is 
given by the nilpotent approximation. 
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Theorem 4.1. A Carnot-Caratheodory space (M,d) admits metric tangent 
spaces (C p M,y) at every point p £ M. The space C p M is itself a Carnot- 
Caratheodory space isometric to (M. n ,d), where d is the sub-Riemannian dis- 
tance associated with a homogeneous nilpotent approximation at p. 

This theorem, due to Bellaiche, is a consequence of a strong version of 
Theorem 13. 5t for q and q' in a neighbourhood of p, 

\d(q,q')-d(q,q')\ < Cstd(p,q)d(q,q') 1 / r . (20) 

In these notes, we present neither the proof of this result, nor the one of 
Theorem 14.11 and we refer the reader to |Bel96] . 

Remark 4.1. Recall that d is not intrinsic to the frame {X\, . . . ,X m ). Thus 
Theorem 14.11 does not provide an intrinsic characterization of the metric 
tangent space. Such characterizations exist for sub-Riemannian manifolds 
(M,D,gn) in [MM00] and [FJ03] , and the latter could easily be adapted 
to the case of a sub-Riemannian geometry associated with a nonholonomic 
system. However these characterizations are intrinsic to the differentiable 
manifold M equipped with the sub-Riemannian structure (D,g R ), or to M 
equipped with the frame (Xi, . . . ,X m ), not to the metric space (M,d). To 
our knowledge, the problem of finding a definition of the metric tangent space 
C p M depending only on the Carnot-Caratheodory space (M, d) is still open. 

The question we want to address now is: what is the algebraic structure 
of CpMl Of course C P M is not a linear space in general: for instance, d 
is homogeneous of degree 1 but with respect to dilations 5 t but not with 
respect to the usual Euclidean dilations. We will see that C p M has a natural 
structure of group, or at least of quotient of groups. 

Denote by G p the group generated by the diffeomorphisms exp(tAj) acting 
on the left on IR n . Since g p = Lie(Xi, . . . ,X m ) is a nilpotent Lie algebra, 
G p = exp(jjp) is a simply connected group, having g p as its Lie algebra. 

This Lie algebra q p splits into homogeneous components 

dp = © ■ • ■ © 0~ r , 

where g~ s is the set of homogeneous vector fields of degree — s, and so g p is a 
graded Lie algebra. The first component g _1 = span(Xi, . . . ,X m ) generates 
q p as a Lie algebra. All these properties imply that G p is what we call a 
Carnot group. 

Definition 4.2. A Carnot group is a simply connected Lie group, such that 
the associated Lie algebra is graded, nilpotent, and generated by its first 
component. 
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Note that the dilations 5 t act on g p as a multiplication by t s on g s . 
This action extends to G p by the exponential mapping. 

Example 4.1 (Heisenberg group). The simplest non Abelian Carnot group is 
the Heisenberg group H 3 which is the connected and simply connected Lie 
group whose Lie algebra satisfies 

= g _1 ©0" 2 , with dim g" 1 = 2. 

As a consequence, dimHI 3 = 3. Choosing a basis X, Y, Z — [X, Y] of g, we 
define coordinates on H 3 by the exponential mapping 

(x, y, z) (-)■ exp(xX + yY + zZ). 

By the Campbell-Hausdorff formula (see Section [A.ip . the law group on H 3 
in these coordinates is 

(x,y,z){x',y',z') = (x + x' ,y + y ', z + z' + hxy' - x'y)), 

which is homogeneous with respect to the dilation 5 t (x,y,z) = (tx,ty,t 2 z). 

Finally, denote by Xi,X2 the left- invariant vector fields on H 3 whose 
values at the identity are respectively X and Y. In coordinates (x,y, z), 
these vector fields write as 

X 1 = d x - 1 9* and X 2 = d y + |<9 a , 

which are the vector fields of what we have called the Heisenberg case in 
examples 13.31 and 13.41 

Let £i, . . . , £ m be the right-invariant vector fields on G p such that £j(id) = 
Xi, where id is the identity of G p . Equivalently, 

= ^[ ex p(^)^]L =0 - 

With . . . , is associated a right-invariant sub-Riemannian metric and 
a sub-Riemannian distance dc v on G p . 

The action of G p on MJ 1 is smooth and transitive. Indeed, for every x G M n , 
the orbit of x under the action of G p is the set 

{expfoXiJ o • • • o exp{U k X ik )(x) : k G N, % E M, ij E {1, . . . ,m}| . 

By Chow-Rashevsky's theorem (or more precisely from Remark 12.31) . this 
set is the whole M™ since {X\,...,X m ) satisfies Chow's Condition on W 1 
(Lemma 13.41) . 

To understand the algebraic structure of C p M we will use the following 
standard result on transitive action of Lie groups (see for instance [Lee03j 
Th. 9.24]). 
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Theorem 4.2. Let G be a Lie group acting on the left smoothly and tran- 
sitively on a manifold M. Let q £ M and H be the isotropy subgroup of q 
which is defined by H = {g £ G : g ■ q = q}. Then H is a closed subgroup 
of G, the left coset space G/H is a manifold of dimension dimG — dim if, 
and the map F : G/H — > M defined by F(gH) = g ■ q is an equivariant 
diffeomorphism. 

Let H p be the isotropy subgroup of £ W 1 under the action of G p . 
According to Theorem I4.2[ the map 4> p : G p — > M n , <p p {g) = g(0), induces a 
diffeomorphism 

ij p : G p /H p -» R n , ij p (gH p ) = g(0). 

Observe that H p is invariant under dilations, since 5 t g(5 t x) = 5 t (g(x)). 
Hence H p is connected and simply connected, and so H p = exp(fj p ), where 
\) p is the Lie sub-algebra of q p containing the vector fields vanishing at 0, 

t) P = {Zeg p : Z(0) = 0}. 

As q p , \) p is invariant under dilations and splits into homogeneous compo- 
nents. 

Now, the elements X%, . . . ,X m of g p act on the left on G p /H p with the 
notation £ 1; ...,C m , 

ZiigHp) = ^[exp(tXi)gH p ] \ f=Q . 

These vector fields define a sub-Riemannian metric and a sub-Riemannian 
distance d on G p /H p . We clearly have ^p*^ = X iy so ip p maps the sub- 
Riemannian metric on G p /H p associated with (Ci, ■ ■ ■ , £ m ) to the one on M. n 
associated with (Xi, . . . , X m ). 

Theorem 4.3. The metric tangent space C P M and (IR n , d) are isometric to 
the coset space G p /H p endowed with the sub-Riemannian distance d. 

Example 4.2 (Grusin plane). Consider the vector fields X\ = d x and X 2 = 
xd y on M 2 . The Carnot-Caratheodory space defined by these vector fields is 
called the Grusin plane. 

The only nonzero bracket is A 12 = d y . Thus, at p — 0, the weights are 
(1,2), and (x, y) are privileged coordinates. Since X\ and X 2 are homo- 
geneous with respect to this system of coordinates, we have X\ = X\ and 
X 2 = X 2 . The Lie algebra they generate is 

0o = span(Xi,X 2 ,Xi 2 ) 
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which is of dimension 3, and the group exp($j ) is actually the Heisenberg 
group H 3 (see example I4.ip . The Lie sub-algebra f)o of g containing the 
vector fields vanishing at is 

P)o = span(X 2 ), 

which is one- dimensional. Thus the Grusin plane is isometric to H 3 / exp(f)o) 
endowed with the distance d. 

Example 4.3 (Martinet case). Consider the Martinet case, defined on R 3 by 

Xi = d x and X 2 = d y + —d z . 

As noticed in example 13. 5[ at p = 0, the coordinates (x, y, z) are privileged 
and, by homogeneity, X\ = X\ and X 2 — X 2 . Moreover the only nonzero 
bracket are X 12 = xd z and X U2 = d z . Thus, 

0o = span(Xi,X 2 ,Xi2,Xii 2 ) 

which is of dimension 4. The group exp(g ) is called the Engel group, and 
is denoted by E 4 . The Lie sub- algebra f) of $j containing the vector fields 
vanishing at is 

f)o = span(Xi 2 ). 

When the point p is regular, this theorem can be refined thanks to the 
following result. 

Lemma 4.4. If p is a regular point, then dimG p = n. 

Proof. Let Xj ± , . . . , Xj n be an adapted frame at p. Due to the regularity of 
p, Xi 1 , . . . , Xi n is also an adapted frame near p, so any bracket Xj can be 
written as 

Xj(z) = «(*) X M> 

{i:\Ii\<\J\} 

where each function of order > |Jj| — \J\. Taking the homogeneous 

terms of degree —\J\ in this expression, we obtain 

Xj(z) = Yl ^Aw. 

{i-\Ii\=\J\) 

and so Xj e span(X/ 1 , . . . , Xj n ). Thus . . . , Xj n is a basis of g p , and so 
dim G p = n. □ 
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As a consequence H p is of dimension zero. Since H p is invariant under 
dilations, H p = {id}, and hence the mapping (f> p : G p — > M. n , <f> p (g) = <?(0), 
is a diffeomorphism. Moreover (pp^i = X{, which implies that (f) p maps the 
sub-Riemannian metric on G p associated with (£1, . . . , £ m ) to the one on W 1 
associated with (X 1? . . . , X m ). This gives the following resullEl. 

Proposition 4.5. When p is a regular point, the metric tangent space C P M 
and the Carnot-Caratheodory space (R™, d) are isometric to the Carnot group 
G p endowed with the right-invariant sub-Riemannian distance dc p - 

Thus Carnot groups have the same role in sub-Riemannian geometry as 
Euclidean spaces have in Riemannian geometry. For this reason they are 
sometimes referred to as "non Abelian linear spaces" : the internal operation 
- addition - is replaced by the law group and the external operation - multi- 
plication by a real number - by the dilations. Note that, when G p is Abelian 
(i.e. commutative) then G p has a linear structure and the sub-Riemannian 
metric on G p is a Euclidean metric. 

Example 4.4 (unicycle). In the case of the distance d associated with the 
unicycle (examples 11.11 and 13. 8p . the growth vector is (2,3) at every point. 
Hence every point p G M 2 x S 1 is regular, and the Lie algebra generated by 
the nilpotent approximation satisfies 

Q P = © 0~ 2 , with dung" 1 = 2. 

As a consequence, G p = H 3 (see example 14. ip . and so the metric tangent 
space to (IR 2 x S 1 ^) at every point p has the structure of the Heisenberg 
group. 

4.2 Desingularization and uniform distance estimate 

We have already highlighted in Remark 13.31 that singular points may cause 
difficulties, in particular because of the loss of uniformity of distance es- 
timates. Therefore it is necessary to study carefully the behaviour of the 
distance at such points. We proceed as it is usual for singularities, that is, 
we consider a singularity as the projection of a regular object. To this aim we 
exploit the algebraic structure of the metric tangent space, which provides a 
good way of lifting and projecting Carnot-Caratheodory spaces. 

Let us begin with nilpotent approximations. We keep the notations and 
definitions of the preceding subsection. At a singular point p, we have the 

*This result appeared first in |Mit85j . but with an erroneous proof. The presentation 
given here is inspired from the one of |Bel96j . 
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following diagram, 

(G p , d Gp ) 
(G p /H p ,d) ==> 

Since the sub-Riemannian metric on G p is a right-invariant, every point in 
the space (G p , dc p ) is regular. 

Definition 4.3. A Carnot-Caratheodory space (M, d) is said to be equireg- 
ular if every point in M is regular. 

Thus (M n ,<f) is the projection of the equiregular space (G p ,dG p )- Re- 
call now that £1, . . . , £ m (resp. £ l5 . . . , £ m ) are mapped to X 1; . . . , X m by </> p 
(resp. Vp)- Working in a system of coordinates, we identify G p /H p with M. n 
and £j with Xj. These coordinates on R" ~ G p /H p , denoted by x, induce 
coordinates (x,z) e R on G p for which we have 

N 

£i(x,z) = Xi(x) + b ij (x,z)d zr (21) 

j=n+l 

Let (x(-), be a trajectory of the nonholonomic system in G p defined 
by Ci) ■ ■ ■ ) Cm- Then, for every control «(•) associated with the trajectory, 



[x(t),z(t)) = ^2ui(t)£i(x,z), 

8=1 



It follows from (12T]) that x(-) is a trajectory in IR n of the system defined by 
£i, . . . , £ m , which is associated with the same controls u(-), so that 

lengthm-)) = length ((x, z) (•)). 
Thus d can be obtained from the sub-Riemannian distance dc p in G p by 
d(<?i, 92) = _ inf d G (qi, q 2 ), for any q x E q x E v , 

or, equivalently, B d {qi ) e) = <p p {B dG p (q Xl e)) . 

We will use this idea to desingularize the original space (M,d). Choose 
for x privileged coordinates at p, so that 

Xi(x) = Xi(x) + Ri(x) with ord p Ri > 0. 
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Set M = M x n , and in local coordinates (x, z) on M, define vector fields 
on a neighbourhood of (p, 0) by 

N 

£i(x, z) = Xi(x) + 2J hj{x,z)d Zj , 

j=n+l 

with the same functions 6^ as in ( 12 ip . Such vector fields are called a lifting 
of the vector fields Xx,...,X m : denoting by 7r : M — > M the canonical 
projection, we have Xi = 7T*£j for i = 1, . . . , m, that is, Xi is the projection 

We define in this way a nonholonomic system on an open set Z7 C M 
whose nilpotent approximation at (p, 0) is (£1, ... , £ m ), by construction. Un- 
fortunately, (p, 0) can be itself a singular point. Indeed, a point can be 
singular for a system and regular for the nilpotent approximation taken at 
this point. 

Example 4.5. Take the vector fields X\ = d Xl , X 2 = d X2 + X\d X3 + x\d Xi and 
X3 = d X5 + x\ 00 d Xi on IR 5 . The origin is a singular point. However the 
nilpotent approximation at is X± = X\, X2 = X2, X% = d X5 , for which is 
not singular. 

To avoid this difficulty, we take a group bigger than G p , namely the free 
nilpotent group A^ r of step r with m generators. A^ r is a Carnot group and its 
Lie algebra n r is the free nilpotent Lie algebra of step r with m generators. 
The given of m generators «i, . . . , a m of n r define on iV r a right-invariant 
sub-Riemannian distance d^- 

The group A^ r can be thought as a group of diffeomorphisms, and so it 
defines a left action on IR n . Denoting by J the isotropy subgroup of for 
this action, we obtain that (M n ,cf) is isometric to N r /J endowed with the 
restriction of the distance d^. 

Reasoning as above, we are able to lift locally the vector fields Xi, . . . , X m 
on M to vector fields on M x M n_n , n = dimiVV, having a±, . . . ,a m for 
nilpotent approximation at (p, 0). Moreover (p, 0) is a regular point for the 
associated nonholonomic system in M x M n_n since N r is free up to step r. 
We obtain in this way a result of desingularization. 

Lemma 4.6. Let p be a point in M, r the degree of nonholonomy at p, 
n = n(m, r) the dimension of the free Lie algebra of step r with m generators, 
and M = M x M n_n . Then there exist a neighbourhood U C M of (p, 0), a 
neighbourhood U C M of p with U x {0} C U , local coordinates (x, z) on U , 
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and smooth vector fields on U , 

N 

£(x, z) = Xi(x) + z ) d *P (22) 

j=n+l 

such that: 

• 61 > ■ ■ ■ j Cm satisfy Chow's Condition and have r for degree of nonholon- 
omy everywhere (so the Lie algebra they generate is free up to step 
r); 

• every q in U is regular; 

• denoting by ir : M — )■ M the canonical projection, and by d the sub- 
Riemannian distance defined by £1, . . . ,£ m on U , we have 7r*£j = X i} 
and for q G U and e > small enough, 

B(q,e)=n(B S ((q,0),e 

or, equivalently, 

d(q u q 2 ) = __ inf d((q u 0), q 2 ) . 

<?2 6tt x (c/2 ) 

Remark 4.2. The lemma still holds if we replace r by any integer greater 
than the degree of nonholonomy at p. 

Thus any Carnot-Caratheodory space (M, d) is locally the projection of 
an equiregular Carnot-Caratheodory space (M, d). This projection preserves 
the trajectories, the minimizers, and the distance. 

Example 4.6 (Martinet case). Consider the vector fields of the Martinet case 
(see example 13.51) , defined on R 3 by: 

X x = d x and X 2 = d y + —d z . 

2 

Let ir : M 4 — > IR 3 be the projection with respect to the last coordinates, 
7r(x, y, z, w) = (x,y,z). Then Xx and X 2 are the projections of the vector 
fields defining the Engel group E 4 (see example EO 

„2 



£i = d x and £ 2 = d y + —d z + xd u 



x 

that is 7T*Ci = Xi. Thus, for every pair of points qx, q 2 G M 3 , 
dMart(?i,92) = inf d E *((qi,0),(q 2 ,w)), 

where c?Mart and d^i are the sub-Riemannian distance in respectively the 
Martinet space and the Engel group. 



44 



Example 4.7 (Grusin plane). Consider the vector fields 

X\ = d x , X 2 = xd y , 

on R 2 , which define the Grusin plane (see example 14.21) . Let tt : R 3 — > R 2 be 
the projection with respect to the last coordinates, n(x,y,z) = (x,y). Then 
X\ = 7r*^i and X 2 = 7r*£ 2 , where 

£1 = d x and £ 2 = d x + xd z , 

are, up to a change of coordinates, the vector fields defining the Heisenberg 
case (see example 13.31) . 

Application: uniform Ball-Box theorem 

The key feature of regular points is uniformity: 

• uniformity of the flag (fTOi) ; 

• uniformity w.r.t. p of the convergence (A(M, d),p) — > C P M (as ex- 
plained by Bellaiche |Bel96i Sect. 8], this uniformity is responsible for 
the group structure of the metric tangent space); 

• uniformity of distance estimates (see Remark 13. 3p . 

In particular the last property is essential to compute Hausdorff dimensions 
(see Section 14.31) or to prove the global convergence of approximate motion 
planning algorithms. Recall what we mean by uniformity in this context: in 
a neighbourhood of a regular point po, we can construct privileged coordi- 
nates depending continuously on the base point p and such that the distance 
estimate (|T5i) holds with C p and e p independent of p. 

As already noticed, all these uniformity properties are lost at singular 
points. However, using the desingularization of a sub-Riemannian manifold, 
we are able to give a uniform version of distance estimates. 

Let Q C M be a compact set. We denote by r max the maximum of 
degrees of nonholonomy at points in Q. As noticed in Section I3.2[ r max is 
finite. We assume that M is an oriented manifold, so that the determinant 
det is well-defined (see |AM78l Def. 2.5.18]). 

Let X be the set of n-tuples X = (X^, . . . ,X Tn ) of brackets of length 
< ^max- Since r max is finite, X is a finite subset of Lie(Xi, . . . , X m ) n . 
Given q 6 Q and e > 0, we define a function f qi£ : X — > R by 

/ 9i£ (X) = | det (X h (q)e lhl , • • • , X In (q)e^) \ . 
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We say that X e X is an adapted frame at (q, e) if it achieves the maximum 
of / ?j£ on X. 

The values at g of an adapted frame at (q, e) clearly form a basis of T q M . 
Moreover, q being fixed, the adapted frames at (q, e) are adapted frames at 
q for e small enough. 

Theorem 4.7 (Uniform Ball-Box theorem). There exist positive constants 
K and e such that, for q e Q, e < e , and any adapted frame X at (q,e), 
there holds 

Box x (g, — e) c B(q,e) c Box x (g, a^), 

A 

where Boxx(<?,£) = {exp(xiX Il ) o • • • o exp(x n X/ )(q) : \xi\ < 1 < i < 
»}■ 

Of course, q being fixed, this estimate is equivalent to the one of the 
Ball-Box theorem for e smaller than some £\(q) > 0. However, the main 
difference is that here e does not depend on q, whereas in the Ball-Box 
theorem E\ = £i{q) which can be infinitely close to as q varies. 

Let vol be any Riemannian volume on the manifold M. As a direct 
consequence of the Uniform Ball-Box theorem, we have an estimate of the 
volume of a small sub-Riemannian ball. 

Corollary 4.8. There exist positive constants K and e such that, for all 
q G f2 and e < Sq, 

Imax/^X) < vo\(B(q,e)) < ^max/^X), 

the maximum o// 9i£ (X) = | det (X/^g)^ 71 ', . . . , X In (q)e^ In ^ | being taken over 
all families X = (X^, . . . , Xz n ) of brackets of length |7j| < r max . 

If moreover all points in Q are regular, then for all q E Q and e < Sq, 

<vo\(B(q,e))<KeQ, (23) 

where Q = Yll=i w i( ( l) does not depend on q. 



4.3 Hausdorff dimension 

Consider a metric space (M, d) and denote by diam S the diameter of a set 
S C M . Let k > be a real number. For every subset A C M, we define 
the k-dimensional Hausdorff measure 'H k of A as % k {A) = lim £ _> + 7i^(A), 
where 

{oo oo 
(diam^) fc : A C \JSi, Si closed set, diam^ < e \ , 
i=i i=i J 
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and the k- dimensional spherical Hausdorff measure S k of A as S k (A) = 
lim £ _ s>0 + S k (A), where 

{oo oo ^ 

(diamS , i ) fc : i C (J 5i, is a ball, diamSj < e > . 
i=i i=i ) 

In the Euclidean space MJ 1 , fc-dimensional Hausdorff measures are often 
defined as 2~ k a{k)% k and 2~ k a{k)S k , where a(k) is defined from the usual 
gamma function as a(k) = r(|) fc /T(| + 1). This normalization factor is 
necessary for the n-dimensional Hausdorff measure and the Lebesgue measure 
to coincide on IR n . 

For a given set A C M, l-L k (A) is a decreasing function of k, infinite when 
k is smaller than a certain value, and zero when k is greater than this value. 
We call Hausdorff dimension of A the real number 

dim n A = snp{k : H k {A) = oo} = mf{k : H k {A) = 0}. 

Note that 1-L k < S k < 2 k, H k , so the Hausdorff dimension can be defined 
equivalently from Hausdorff or spherical Hausdorff measures. 

There exist only few results on Hausdorff measures in sub-Riemannian 
geometry, except for specific cases A B B 1 11 IGJ12j . The most general result 
is the following one. 

Theorem 4.9. Let (M, d) be an equiregular Carnot-Caratheodory space and 
p a point in M. Then the Hausdorff dimension of a small enough ball B(p, r) 
is dim-H B(p, r) = Q, where 

n 

Q = ^Wiip) = J^i (dim A l {p) - dim A i_1 (p)) 

i=l i>l 

does not depend on p. Moreover T-L^(B(p,r)) is finite. 

Proof. Fix a Riemannian volume vol on B(p,r) (it is possible for a small 
enough r). It results from Corollary 14.81 that, for q G B(p,r) and e small 
enough, 

^e Q <vo\(B(q,e)) < Ke Q . (24) 
K 

Define N £ to be the maximal number of disjoints balls of radius e included 
in B(p, r), and consider such a family B(qi, e), i =, . . . , N E , of disjoints balls. 
By (ED, 

^e Q N £ < vol(B(p, r)) N e < Ke' Q vo\(B{p, r)). 
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On the other hand the union [J i B(qi, 2e) covers B(p, r), and by Theorem 14.71 
every ball B(qi, 2e) is of diameter > j^e if e is small enough. This implies 

S Q (B(p,r)) < lirrunf N £ < oo. 

Therefore dim% B(p,r) < Q. 

Conversely, let \J { B{q^ r^) be a covering of I?(p, r) with balls of diameter 
not greater than e. If e is small enough, every is smaller than e an d there 
holds 

vol(5(p,r)) < J2 yol ( B (Qi,n)) < Kj2 r ?- 

i i 

As a consequence, we have S®(B(p,r)) > vo\(B(p,r))/K, which in turn 
implies dim-u B(p, r) > Q. This ends the proof. □ 

When (M, d) is not equiregular, the Hausdorff dimension of balls centered 
at singular points behaves in a different way. Let us show it on an example. 

Consider the Martinet space (see Example I3.5p . that is, M. 3 endowed with 
the sub-Riemannian distance associated with the vector fields 

Xi = d x and X 2 = d y + —d z . 

2 

A point q = (x,y,z) is regular if x ^ and in this case Yli w i{ < i) = 4, 
otherwise it is singular and J2i w i( ( l) = 5. 

Lemma 4.10. Let p a point in the Martinet space. 

• If p is regular, then dim-^ B(p,r) =4, andl-L i {B{j) 1 r)) is finite. 

• If p is singular, then dim-^S(p, r) = 4, but l-C i {B{p,r)) is not finite. 

Proof. When p is regular, the result is a direct consequence of Theorem 14.91 
Let us consider a singular point p and a radius r > 0. Since regular points 
form an open set, B(p,r) contains small balls centered at regular points, 
and thus dim^ B(p,r) > 4. Moreover, it results from Corollary 14.81 that, for 
q = (x, y, z) close enough from p and for e > small enough, 

— £ 4 max(|x|,e) < vol(B(q, e)) < Ke A max(|a;|, e). (25) 
K 



We proceed as in the proof of Theorem 14.91 Define N e to be the maximal 
number of disjoints balls of radius e included in B(p,r), and consider such 
a family B{q^ e), i =,..., N £ , of disjoints balls, with ^ = (x^, Zj). Notice 
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that the first coordinate x is of nonholonomic order < 1 at any point; this 
implies that there exists a constant K' > such that 

B(q h e) C B(p,r) H {q = (x,y, z) : \x-Xi\<K'e}. 

As a consequence, for an integer k, every ball B(qi,e) such that (k — l)e < 
< fee is included in the set B(p,r) fl {g = (x,y,z) : |x| G ((A; — 1 — 
X')e, (A; + iiC')e]}. The volume of the latter set is smaller than K"e, where 
K" is a constant (depending neither on k nor e). Then it results from (J25]) 
that 

KN £ (k)ke 5 < K"e, 

where N e (k) is the number of points qi such that (k — l)e < \xi\ < ke. The 
Ball-Box Theorem implies that N e (k) = when k > K'r/e, and hence 

\K'r/e] \K'r/e\ , . 

/ j \ const 1 const , / 1 \ 

jv.= E — £ ^— io g ( ; j, 

k=l k=l v 7 

where \t \ denotes the integer part of a number t. Now the union [J^ -6(9i> 2e) 
covers B(p,r) and every ball B(qi,2e) is of diameter > -^e if e is small 
enough. This implies that, for any real number s > 4, 

S s (B(p,r)) < lim ^^fj N £ < lim const e s ~ 4 log Q = 0. 

Consequently dim-^ £>(p, r) < 4, and hence dim-^ B(p,r) = 4, since the con- 
verse inequality holds. 

We are left to show that % A (B{p, r)), or equivalently S 4 (B(p, r)), is not 
finite. Let [J i B(qi,ri) be a covering of B(p,r) with balls of diameter not 
greater than e. For an integer k > 1, denote by the set of indices such 
that U ie/ B(qi,ri) is a covering of the set B(p,r) fl {g = (x,y,z) : |s| G 
((Jfe- l)e, k Jfee]}. Thus 

vol(5(gj, rj)) > const e. 

On the other hand i G Ik implies vol(B(qi, ri)) < const r t 4 A;£:, and so 

const 



Summing up over k, we obtain, for a small enough e, 



r\ > const log ^-J 
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As a consequence, S^(B(p,r)) > const log(^), and so S^(B(p,r)) = oo. This 
ends the proof. □ 
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Appendix 



A Flows of vector fields 

This section is dedicated to the proof of Campbell-Hausdorff type formu- 
las for flows of vector fields. The result in Section IA.1I has been used in 
Section |2"TT} the one in Section fA. 21 will be necessary for the next section. 

Let U be an open subset of R n and VF(U) the set of smooth vector fields 
on U. Given a vector field X G VF(U), we denote its flow by exp(iX). 

A.l Campbell-Hausdorff formula for flows 

We will need in this section the Campbell-Hausdorff formula which we recall 
briefly here (for a more detailed presentation see for instance |Bou72l Ch. II]). 
Let x and y be two non commutative indeterminates, and [x, y] = xy — yx 
their commutator, also denoted by [x, y] = (adx)y. The length of an iterated 
commutator (acLci) • • • (adxk-i)xk, where each xi,...,Xk equals x or y, is 
defined to be the number of occurrences k of x and y. Define also e x and e v 
to be the series ^ fc>0 fr an d J2k>o &p Then we have e x e y = e H ( x ' y > in the 
sense of formal power series, where 

H{x,y) = x + y+ ~[x,y) + R(x,y), (26) 

and R(x, y) is a series whose terms are linear combination of iterated com- 
mutators of x and y of length greater than 2. For an integer N we denote by 
Hn(x, y) the partial sum of H (x, y) containing only iterated commutators of 
length not greater than N. In particular, H\ = x + y and H 2 = x+y+^[x, y]. 

Consider now two vector fields X, Y G VF(U). Given i G 1 and an 
integer N, H^{tY, tX) is a smooth vector field on U which writes as 5^»=i t % Yii 
where the vector fields Y±, . . . , belong to the Lie algebra generated by X 
and Y. 

Lemma A.l. Let p G M. There exist positive constants 5 and C such that 
\t\ < 8 implies 

||exp(tX) oexp(tY)(p) - exp(H N (tY,tX))(p)\\ < C\t\ N+1 . 

Proof. Set ip{t) = exp(tX) o exp(tY)(p), which is a function defined and 
C°° in a neighbourhood of G JR., and let (x%, . . . , x n ) be a system of local 
coordinates on a neighbourhood of p in U. We will compute the Taylor 



51 



expansion of every component Xi(ip(t)), for i — 1, . . . ,n. To do this, we 
introduce the function (j)(t, s) = exp(tX) o exp(sF)(p), so that ip(t) = <f>(t,t), 
and we compute the partial derivatives of Xi o cj> at G R 2 . We have: 

dx ■ o (h d 

—^(t, s) = - [ Xi o exp(tX)] (exp( S y)(p)) = X Xi ((f>(t, s)), 

where Xxi denotes the Lie derivative of x^ along X. Repeating this compu- 
tation, we obtain for any integer k, 



d k Xj o 



dt k 



it )S ) = x k x l m ) s)). 



In the same way, we have: 



d k+l xi o 
ds l dt k 



(0,0) 



d l d k Xi o 
ds 1 dt k 



-(0,s 

Ql 

ds 1 



\s=0 



[X k x t (exp(sY)(p))} 



s=0 



Y l X k Xi (p). 



We then deduce that the formal Taylor series of Xi(i/)(t)) = Xi((f>(t,t)) at is 



fk+l 



S m Ylxkx ^ p) 



k,l>0 



y-y 



l>0 



k>0 



where X and Y are considered as derivation operators. From the Campbell- 
Hausdorff formula, the product of the formal series e tY = Y2i>o ff^' with 

e tx = '^2k>oy.-^ k ^ s ec L ua l to the series e H ^ tY ' tx \ As a consequence, the 
Taylor expansion of Xi(ij}(t)) up to degree N is given by the terms of degree 
< iV in the series e H ( tY ' tx ^Xi(p), which coincide with the terms of degree < iV 
in the series e HN ^ tY,tx 'Xi(p). 

On the other hand, it results from Lemma [A. 21 below that e HN ^ tY,tx 'Xi(p) 
is the Taylor series at of the function t \-> Xi o exp(H]y(tY, tX))(p). Thus 



Xi o ,/;(t) - Xi o exp(H N (tY, tX))(p) = 0(\t\ N+1 ) 
for every coordinate x^, and the lemma follows. 



□ 



Lemma A. 2. Let Y\,...,Ye be vector fields on U, f : U — > R a smooth 
function, and p G U . The formal Taylor series at G R^ of the function 
Oi, ■ • ■ , Zi) i y /(expQ^ ZiYi)(p)) is given by 



fc>0 
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The formal Taylor series at G R of the function t \-> f(exp(Y(t))(p)), 
where Y(t) = E^ =1 ^ s 5wen ^2/ 

£^n*)*/(p) = e y(t) /(p)- (27) 

fe>0 

Proof. The second statement is obviously a consequence of the first one, 
so it is sufficient to prove the latter. We introduce the functions g(z) = 
/(expQ^ ZiYi){p)) and G(t,z) = g(tz) which are well-defined and smooth 
on a neighbourhood of in R^, respectively R x M. e . We are looking for the 
Taylor series of g at 0. 

Since G(t,z) = /(exp(t z;Yi)(p)), we have, for any integer k > 0, 

^(o, 2 ) = (E^) fc /w- 

i 

On the other hand G(t,z) = g(tz), and hence the previous derivative can 
also be computed as 

7>.*>= E « z "---" , aA ? (0). 

aiH \-ot£=k 1 



Combining both expressions, we obtain 



Z ai . . . Z °t Q k g \ f \ 

^ «!!■•■ a,! ^f 1 • • • (0) = Jfe! I ? ^ ) 

ha f =fc \ i / 



fe 

/to, 



«iH ha^=fc 

and the lemma follows. □ 

Lemma IA.1I can be extended in two ways. First, since the vector fields 
and their flows are smooth on U, the estimate holds uniformly with respect 
to p. Second, by Lemma IA.2[ the vector fields tX and tY may be replaced 
by the one-parameter families of vector fields X(t) = tX\ + • • • + t k X^ and 
Y(t) = tY\ + • • • + t l Ye, where X\, . . . , X^ and Yi, . . . , Ye are vector fields 
on U. As an example, H^(tY,tX) is of this form. To summarize, a slight 
change in the proof of Lemma [A. II actually shows the following result. 

Corollary A. 3. Let K C U be a compact. There exist two positive constants 
S,C such that, if p e K and \t\ < 5, then: 



\exp(X(t))oex V (Y(t))(p)-ex V (H N (Y(t),Y(t)))(p)\\ < C\t 



N+l 
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We are now in a position to prove formula ([H]), that we used in the proof 
of Lemma 12.11 Let X 1; . . . , X m be m elements of VF(U). For every multi- 
index I G {1, . . . ,m} k , k G N, we define the local diffeomorphisms <f>\ on U 
by induction on the length \I\ of I. Let (p\ = exp(£X;) and set, if I = iJ, 

Proposition A. 4. Let K C U be a compact and I a multi-index. There 
exist two positive constants 5,C such that, if p G K and \t\ < 5, then 

||^(p)-p-^Xj(p)|| < C\t\^ +1 . (28) 

Proof. For 5 > small enough, the mapping (p,t) h- > 4> t {jp) is defined and 
C°° on K x (—5,5). As a consequence, we are reduced to prove (1281 for a 
fixed p G A. 

When |/| = 1, 0f(p) = exp(tXi)(p) for some i G {l,...,m}, which is 
equal to p + tX^p) + 0(\t\ 2 ). 

Now, let / be a multi-index and iV > |/| an integer. Corollary I A . 3 1 implies 
that <#(p) = exp(H}f(t))(p) + 0(t N+1 ) where the series H^(t) is defined by 
induction: if / = % J, then 

Hx{t) = H N (H N (tXi,H J N (t)),H N (—tXi,—H J N (t))). 

Applying (j2B]) iteratively, we can write H^(t) = t |/| A / + tl / l +1 J R 7 (t), the latter 
term being a one-parameter vector field. As a consequence, 

4>l(p) = p + t^Xi(p) + terms of degree greater than \I\ + 1, 

which completes the proof. □ 

A. 2 Push-forward formula 

Given two vector fields X, Y G VF(U), we write (adX)F for [X, Y], (adX) 2 F 
for (adX)((adX)F), etc. 

Proposition A. 5. Let K C U be a compact, N a positive integer, and X , 
Y , Yi, . . . , Ye vector fields in VF(U). There exist two positive constants 5, C 
such that, if p G K, t G R and z6R' satisfy \t\ < 5 and \\z\\ < 5, then 

exp(*y).*(p) - Ef=o ^(adF) fe X(p) 

1 / \ ^ 

ex p(ELi - Ef= ^ ( ad ELi 

where exp(tY)*X = d(exp(tY)) o X o exp(— tF) denotes the push-forward of 
the vector field X by the diffeomorphism exp(tY). 



<C\t\ N +\ 



< C\\z\\ N+l , 
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Proof. Let us begin with the first inequality. Set <p p (t) = exp(tY)*X(p). 
For 5 > small enough, the mapping (p,t) (->■ p (t) is defined and C°° on 
K x (—5,5). As a consequence, there exists a constant C > such that, for 
every p G K and |t| < 5, we have 



N ik Jk, 



fc! cft fc 

fc=0 



< C\t 



N+l 



It remains to prove that ^^(0) = (adY) k X(p) for any integer k. Note first 
that <f) p (0) = X(p), and that 

is by definition equal to — LyX{p), where LyX is the Lie derivative of X 
along Y (see for instance |Boo 86j ) . Since LyX(p) = (adY)X(p), the cases 
k = and /c = 1 are done. 

We need now to compute -Jr(i) at i 7^ 0. Let us write (j) p {t + s) as 
exp(iF)* exp(sF)*X(p). We have 

^(t) = dMt d ^ s) 1=0= exp(ty).^ M«y)^ | s=0 (p) 

= exp(tF),((adF)X)(p). 

This derivative has the same form as <f> p (t), X being replaced by (adY)X. 

Iterating the argument above, we obtain by induction 4^(0) = (a,dY) k X(p), 
and the first inequality of the proposition is proved. 

As for the second inequality, the same reasoning applies and we only 
need to compute the partial derivatives at 6 l f of the function (f>(z) = 
ex P(J2i z iYi)*X{p). This can be done as in the proof of Lemma [A.21 The 
proposition follows. □ 

B Different systems of privileged coordinates 

This appendix is devoted to the proof that the examples of coordinates in- 
troduced in Section 13.21 are actually privileged coordinates. 

B.l Canonical coordinates of the second kind 

Let p G M and Y"i, . . . , Y n an adapted frame at p (see ( ITT]) , page [UJ). The 
map 

<p : (zi, . . . , z n ) h-)> exp(z n E n ) o • • ■ o exp(ziYi)(p) 
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is a local diffeomorphism near G M. n and its inverse defines some coordinates 
called canonical coordinates of the second kind near p. 
The following result is due to Hermes |Her91j . 

Lemma B.l. Canonical coordinates of the second kind are privileged at p. 

For sake of simplicity, we will write the compositions of maps as products; 
for instance, we write 

(f)(z) = exp(ziYi) ■ ■ ■ exp(z n Y n )(p). 

Proof. First, let us recall that is a local diffeomorphism at z = because 
its differential at is an isomorphism. This results from 

^(0) = j t (0(0, . . . , t, . . . , 0)) | t=Q = j t (eMmip)) U = Yib), 

for i = l,...,n. This computation also reads as 0*^-(p) = Y^p), which 
implies YiZi(p) = 1 (as in Section I3TT1 YiZi denotes the Lie derivative of the 
function Zi along the vector field Yj). Hence the order of Z{ at p is not greater 
than Wi. 

It remains to show that the order of Zj at p is at least Wi for each i = 
1, . . . , n. This is a direct consequence of the following assertion. 

Claim. Let X be one of the vector fields X\, ... ,X m . Then, for i = 1, ... ,n, 
the Taylor expansion at z = of the function a^z) = Xz; t (4>(z)) is a sum of 
homogeneous polynomials in the coordinates z of weighted degree > Wi — 1 . 
From the very definition of ai(z), we have 

X(4>(z)) = f>(*)|^). (29) 

i= 1 

Given z, let <p be the diffeomorphism defined on a neighbourhood of p by 
ip(q) = exp(ziFi) ■ ■ • exp(z n Y n )(q). In particular, ip{p) = (j){z). In order to 
obtain an equality in T P M, we apply the isomorphism (difp)^ 1 to both sides 
of (l!2"9~j) . and we get 

n f) 
i=i 1 

This equality is of the form W = XT=i a *^$> where the vectors W = W(z) 
and Vt = Vi{z), i = 1, . . . , n, belong to T P M. If we denote by b = b{z) G W 1 
the coordinates of W in the basis (Yi(p), . . . , Y n {p)) of T P M, and by P = P(z) 
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the (n x n)-matrix of the coordinates of Vi, . . . , V n in the same basis, then 
the vector a(z) = (ai(z), . . . , a n (z)) appears as the solution of Pa = b. 

Note first that -P(O) equals the identity matrix I. Therefore both matrices 
P{z) and P(z)~ l are equal to I+homogeneous terms of positive degree. Hence 
the Taylor expansion of ai{z) and the one of bi(z) have the same homogeneous 
terms of lower degree. 

On the other hand, since y? -1 = exp(— z n Y n ) ■ ■ -exp(— z{Yi), we have 

W(z) = exp(-z n Y n )* ■ ■ ■ exp(-ziKi)*X(p). 

Let us choose an integer N bigger than all the weights and apply Propo- 
sition [A]5] to exp(— ziYi)*X, then to exp(— ^Y^MadYi^X, and so on, 

N (-z V 1 

W(z) = exp(-z„r n ),---exp(-^r 2 )^^ T ^(adF 1 ) /l X(p) 

+0(\'i\ N+l ) 

= E ^^---t^(adr n )'"...(adn)^(p) + 0(1,0. 
h,..,i n =o Ll - Ln - 

Hence every coordinate b{{z) of W(z) satisfies 

h{z) = £ tp£...t^p i + o(\z\™), (30) 
ii,..,/„=o Lv Ln ' 

j3\ being the zth coordinate in the basis (Yi(p), . . . , Y n {p)) of the vector 
(a.&Y n ) ln ■ ■ ■ (adYi) Zl X(p). The latter vector belongs to A w (p), where w = 
1 + l\Wi + ■ ■ • + l n w n (recall that X 6 A 1 and Y { e A Wi ). Since (i^i, • • • , Y n ) is 
an adapted frame at p, (3\ is zero when 1 + liW\ + • ■ ■ + l n w n < Wi. It follows 
that bi(z) - and then a.i{z) - contains only homogeneous terms of weighted 
degree greater than or equal to Wi — 1. This ends the proofs of both the claim 
and the lemma. □ 

B.2 Canonical coordinates of the first kind 

Let jjgM and Y\, . . . , Y n an adapted frame at p. The map 

: (zi, . . . , z n ) i-)- exp^iFi H h z n Y n )(p) 
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is a local diffeomorphism near G M. n since its differential at is an isomor- 
phism. This results from 

§-(0) = ± (0(0, . . . , t, . . . , 0)) | t=0 = ± (ex P (^)(p)) U = X(P)> 

for i — 1, . . . , n. The inverse of defines some local coordinates near p called 
canonical coordinates of the first kind. 

Lemma B.2. Canonical coordinates of the first kind are privileged at p. 

The first proof of this lemma appeared in |RS76j , with a different formu- 
lation. The proof we present here is rather different. 

Proof. The proof follows exactly the same lines as the one of Lemma \B.1\ 
replacing (f> by 0, and ip by <p = exp(^V ZjYj). We are left to compute the 

coordinates bi(z), i = 1, . . . , n, of the vector W(z) = (^~ 1 ) i(! X(p) in the basis 
(Yi(p), . . . ,Y n (p)) of T p M. It results directly from Proposition IA.5I that 

N 1 / I \ k 

w^) = En ad E*^ x <p) + °(\*\ n+1 )> 

k=0 \ i=l J 

N 

= E E ^i 1 ---4 n ^(p) + o(N JV+1 )> 

k=0 h-\ \-l n =k 

where Zi belongs to A w (p), with w — 1 + ZitUi + • ■ ■ + Z n w n . Thus every 
coordinate 6,(2;) of W(z) satisfies 

JV 

& >) = E E ^■■■^M + o(\z\ N+i ), 

k=0 ZiH \-l„=k 

j5\ being the ith coordinate of Z\ in the basis (Yi(p), • • • , Y n (p)). This expres- 
sion is similar to (1301) . and the same conclusion follows. □ 



B.3 Algebraic coordinates 

Let us recall the construction of the algebraic coordinates (zi, . . . , z n ) given 
in page EH Let Y±, . . . , Y n be an adapted frame at p, and (y±, . . . , y n ) be local 
coordinates centered at p such that d Vi \ p = Yi(p). For j = 1, . . . , n, we set 

Wj-l 

z i = Vi ~ E hk ( yi > ' ' ' ' y i-d> ( 31 ) 

k=2 



58 



where, for k — 2, . . . , Wj — 1, 



h k ( yu . . . , yi _o = ^ rr • • • ^r 1 - E Mv)) (p) 



|a|=fc <?= 2 
io(a)<«)j 



with \a\ = ct\ + ■ ■ ■ + a r 
Lemma B.3. The algebraic coordinates (z±, . . . , z n ) are privileged at p. 

The proof of the lemma is based on the following result. 
Lemma B.4. A function f is of order > s at p if and only if 

(c-r/)(p) = o 

for all a such that w(a) < s. 

Proof. Let / be a function of order > s at p. Using the rules ([9]), we have 
ordp(Fj) > — Wi for i = 1, . . . , n, and hence ord p (Y" 1 Q:i • • • Y" n ) > —s for every 
a = («!,..., a n ) such that w(a) < s. Consequently, for such an a the 
function Y^ 1 ■ ■ ■ Y" n f is of positive order, and so vanishes at p. 

Conversely, let / be a function of order < s at p. We introduce the 
canonical coordinates of the second kind (x±, . . . ,x n ) defined by means of 
the adapted basis Y 1} . . . ,Y n . Proposition 13.21 implies that there exists a 
such that w(a) = ord p (/) < s and (d^ ■ ■ ■ d% n f)(p) ^ 0. Moreover, every 
vector field Y i: i = 1, . . . ,n, writes in coordinates x as 

n 

'S^Y? {x)d x ., where ord p (F/) > Wj — Wi. 

3=1 

There also holds 17(0) = 5^ since Y^p) = d Xi . As a consequence, 

w((3)<w(ct) 

and thus (Y"/* 1 • • • Y£ n f)(p) = (d^ ■ ■ ■ f) (p) ^ since w{a) = ord p (/). 
This ends the proof. □ 

Proof of Lemma \B. 31 Let i £ {1, . . . , n}. Note first that Y^(p) = Yiy^p) = 
1, which implies ord p (zi) < Wi. It remains to show that ord p (zi) > Wi. For 
this we will use the criterion of Lemma IB. 41 
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Let a such that w(a) < Wi (and so |a| < u>j). Using the expression f l3~Tj) 



of we obtain 



Wi— 1 



37* • • • y^ = F" 1 • • • r«« U - E h M 

\ k=2 

( H-i \ /toi-i \ 

= n ai • • • >r \vi - E ^fo) - • • • r « B Em») ■ (32) 

The functions hf~ are given by 

fc fc (l/) = E . . . r/- ( w - E Mi/)) (p) ^ • • • |=V 

w(f))<Wi 

Therefore, we clearly have (Y" 1 ■ ■ ■ Y° n hk) (p) = if k > \a\, and 

(y^ ■ ■ ■ yt Vi) ( P ) = yt • • • y n an \y t - E Mv) j W- 

Plugging this expression into ( 152]) . we obtain (Y"" 1 ■ ■ ■ Y^^Qo) = 0, which 
ends the proof. □ 
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